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INTRODUCTION 

The  primary  objective  of  this  project  is  to  carry  out  secondary  analyses  of  data  on  risk-protective 
factors  for  workplace  violence  perpetration-victimization  in  the  Army  Study  to  Assess  Risk  and 
Resilience  in  Servicemembers  (A-STARRS),  the  largest  epidemiological  study  of  mental  health 
risk  and  resilience  ever  conducted  among  US  Army  personnel.  Although  the  primary  focus  of  A- 
STARRS  is  suicide,  much  information  also  exists  on  other  topics,  one  of  them  being  violence. 
Using  A-STARRS  datasets,  we  will  estimate  the  prevalence  and  predictors  of  workplace 
violence  perpetration  and  victimization  in  order  to  develop  risk  prediction  tools  that  can  be  used 
by  the  Army  to  target  soldiers  at  high  risk  of  workplace  violence  perpetration  or  victimization.  A 
secondary  objective  is  to  use  the  results  of  these  analyses  to  expand  knowledge  about  modifiable 
risk  and  protective  factors  for  workplace  violence  perpetration  and  victimization  in  the  Army. 
The  latter  information  could  be  of  value  to  the  Army  to  develop  or  modify  preventive 
interventions  for  workplace  violence  perpetration  and  victimization. 

KEYWORDS 

U.S.  Army;  new  soldiers;  workplace  violence;  physical  violence;  sexual  violence;  perpetration; 
victimization;  prevalence;  socio-demographics;  risk  factors;  protective  factors;  administrative 
data;  survey  data;  stepwise  logistic  regression;  elastic  net  penalized  regression;  cross-validation; 
machine  learning;  data  mining;  risk  algorithm;  risk  score 

OVERALL  PROJECT  SUMMARY 

Due  to  the  Year  1  delay  in  obtaining  the  Army  Omnibus  DUA,  work  and  spending  were  also 
delayed.  We  continue  to  work  to  catch  up  to  our  original  timeline.  The  current  report  therefore 
updates  all  Specific  Aims  for  Years  1  and  2  because  our  work  this  year  included  much  of  the 
work  planned  for  Year  1.  Note  that  spending  was  also  delayed  until  work  began,  causing  a 
substantial  carryforward  from  Year  1  to  Year  2,  and  a  slightly  lower  carryforward  from  Year  2  to 
Year  3.  Our  Current  Objectives  are  described  within  the  Specific  Aims,  where  we  also  discuss  in 
detail  results  from  numerous  analyses  we  have  performed  over  the  past  year.  Due  to  space 
constraints,  all  Main  Tables  are  included  in  the  appendices  rather  than  embedded  in  text. 

Specific  Aim  1:  Merge  data  on  workplace  violence  across  four  administrative  datasets  and 
generate  descriptive  epidemiological  data  on  prevalence/socio-demographics  of  workplace 
violence  in  the  Historical  Administrative  Data  System  (HADS)  2004-2009.  Distinguish  violent 
acts  in  terms  of  content  and  as  either  investigated  but  not  charged,  charged  but  not  founded,  or 
founded. 

Update:  In  Year  1,  we  generated  a  coding  scheme  to  classify  offense  types  and  identify  which 
offenses  should  be  categorized  as  workplace  violence  by  reviewing  a  number  of  alternative 
classification  schemes.  We  finally  settled  on  the  Bureau  of  Justice  Statistics  National  Corrections 
Reporting  Program  (NCRP)  classification  system  (United  States  Department  of  Justice).  We 
categorized  offense  types  into  violent  and  non-violent  crimes  and  also  distinguished  between 
crimes  related  to  family  (e.g.,  domestic  violence)  versus,  crimes  not  related  to  family.  We 
consider  all  crimes  that  are  both  violent  and  non-familial  as  “workplace  violence,”  as  these  are 
the  violent  crimes  that  were  committed  by  Regular  Army  personnel  while  on  active  duty. 
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In  Year  2,  after  doing  a  review  of  the  literature,  we  slightly  reorganized  the  eategorization  of 
erirnes  that  was  presented  in  the  Year  1  report.  We  moved  simple  assaults  and  “other  physical 
violence”  to  a  category  of  minor  physical  violence  offenses.  We  also  divided  verbal  violence  into 
minor  verbal  physical  offenses  (e.g.,  intimidation  and  harassment)  and  verbal  sexual  violence 
(e.g.,  sexual  harassment;  referred  to  as  “minor”  sexual  violence).  Cross-tabulations  were  then 
used  to  estimate  standardized  rates  of  the  new  set  of  violence  outcomes.  See  attached  Main  Table 
1  for  the  rates  of  perpetration  and  victimization  of  violent  crime  per  1,000  person-years  (PY)  in 
the  HADS  2004-2009  data  using  the  new  categorization.  Whereas  the  Year  1  report  only 
presented  rates  in  the  total  sample,  rates  are  now  also  presented  separately  for  male  and  female 
soldiers.  Note  that  the  vast  majority  of  all  crimes  are  “founded.”  After  looking  carefully  at  these 
standardized  rates,  we  decided  to  continue  to  focus  on  “founded”  crimes  for  the  perpetration 
outcomes  of  interest.  For  victimization,  we  decided  to  define  our  outcomes  based  on  any 
reported  victimization  (i.e.,  since  the  offender  may  not  always  be  known  or  charged). 

Aggravated  assaults  made  up  the  vast  majority  of  major  physical  violent  crimes  (1.6  “founded” 
perpetrations  and  1.0  victimizations  per  1,000  PY  in  the  total  sample),  simple  assaults  were  the 
primary  form  of  minor  physical  violence  (9.1  perpetrations  and  7.6  victimizations  per  1,000  PY), 
and  rape/sodomy/sexual  assaults  (2.3  perpetrations  and  1.8  victimizations  per  1,000  PY  in  the 
total  sample)  was  more  common  than  verbal  sexual  violence  (0.3  perpetrations  and  0.1 
victimizations  per  1,000  PY  in  the  total  sample). 

Based  on  these  standardized  rates,  we  prioritized  which  workplace  violence  (i.e.,  non-familial) 
outcomes  for  which  we  would  develop  risk  prediction  tools.  Specifically,  we  identified  outcomes 
that  were  frequent  enough  to  produce  stable  prediction  equations  among  Regular  Army  soldiers: 
major  physical  violence  perpetration  and  victimization  (i.e.,  excluding  sexual  violence)  among 
males,  females,  and  in  the  total  population;  perpetration  of  major  (e.g.,  rape,  sexual  assault) 
sexual  violence  among  males;  victimization  of  sexual  violence  among  males,  females  and  in  the 
total  population;  and  perpetration  and  victimization  of  minor  physical  violence  among  males, 
females  and  in  the  total  population.  We  decided  not  to  include  sexual  verbal  violence  because 
the  frequency  of  perpetration/victimization  was  too  low  to  obtain  a  stable  prediction  model. 

Specific  Aim  2:  Analyze  longitudinal  profiles  of  recurrence  of  administratively-recorded 
workplace  violence  perpetration  and  victimization  in  the  HADS. 

Update:  Our  Year  1  report  presented  a  table  of  recurrence  of  administratively  recorded 
perpetration  of  workplace  violence.  We  have  now  updated  this  table  to  correspond  to  the  final 
set  of  outcomes  described  in  Specific  Aim  1.  Disaggregation  of  the  rates  from  Main  Table  1 
show  that  the  percent  of  active  duty  Regular  Army  soldiers  in  service  at  any  time  between  2004- 
2009  who  ever  during  this  period  were  accused  of  perpetration  of  major  physical,  minor 
physical,  or  major  sexual  workplace  (i.e.,  non-familial)  violence  were  0.6%,  2.9%,  and  0.6%, 
respectively  (see  attached  Main  Table  2).  Roughly  one-quarter  of  perpetrations  of  major  physical 
violence  are  recurrences  rather  than  first  offenses,  and  roughly  one-third  of  perpetrations  of  both 
minor  violence  and  major  sexual  violence  are  recurrences.  Similar  patterns  of  perpetration 
recurrence  are  seen  only  among  male  soldiers.  In  contrast,  female  soldiers  had  lower  rates  of 
perpetration  recurrence  (i.e.,  15-20%  of  all  major  physical,  minor  physical,  and  major  sexual 
perpetrations  by  females  were  recurrences).  The  percent  of  active  duty  Regular  Army  Soldiers  in 
service  who  were  ever  victims  of  major  physical,  minor  physical,  or  major  sexual  violence  were 
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0.4%,  1.7%  and  0.5%  respectively.  6.3%  of  all  soldiers  who  were  victims  of  major  physical 
violence  were  victimized  more  than  once,  1 1.3%  were  repeated  victims  of  minor  physical 
violence,  and  16.8%  were  repeated  victims  of  major  sexual  violence.  Similar  victimization 
estimates  were  observed  separately  in  male  and  female  soldiers.  Given  that  rates  of  single  (i.e., 
first)  perpetrations  and  victimizations  were  much  higher  than  recurrences,  we  decided  to 
prioritize  the  development  of  risk  algorithms  that  predict  the  first  occurrences  of  perpetration  or 
victimization  of  our  workplace  violence  outcomes  of  interest.  However,  we  also  plan  to 
eventually  develop  risk  prediction  tools  for  reoccurrences  of  perpetration  and  victimization. 

Specific  Aim  3:  Compare  violence  perpetration-victimization  rates  with  consenting  soldier 
administrative  data  for  soldiers  in  the  A-STARRS  survey  sample  2011-2013. 

Update:  In  Year  1  we  were  delayed  in  receiving  the  updated,  corrected  consenting  soldier 
administrative  data  for  the  A-STARRS  survey  samples.  In  Year  2  we  have  focused  our  analysis 
for  this  aim  on  survey  data  from  the  New  Soldier  Study  (NSS).  The  NSS  interviewed  new 
soldiers  recently  arriving  to  Basic  Combat  Training  occurred  between  April  2011  and  December 
2012,  and  we  have  person-month  outcome  data  up  until  December  2013.  Thus,  the  maximum 
number  of  months  of  follow-up  for  NSS  respondents  is  32  months  (for  soldiers  interviewed  in 
April  2011)  while  the  minimum  number  of  months  follow-up  is  12  months  (for  soldiers 
interviewed  in  December  2012).  We  consequently  developed  a  weight  for  the  NSS  sample  so 
that  we  could  compare  NSS  rates  of  perpetration  and  victimization  to  rates  in  the  2004-2009 
HADS.  This  weight  made  adjustments  so  that  survey  responders  who  completed  the  NSS  and 
gave  permission  to  link  to  their  administrative  data  were  representative  of  all  soldiers  who  took 
the  NSS  survey  regardless  of  whether  they  completed  or  gave  administrative  data.  The  weight 
also  made  adjustments  so  that  the  NSS  sample  would  be  representative  of  the  total  Army 
population  2011-2013.  However,  it  is  important  to  note  that  the  Army  provided  us  with  a  limited 
number  of  covariates  to  develop  our  weight.  For  example,  we  were  given  no  information  on 
perpetration  and  victimization  of  crime  in  the  total  population  from  2011-2013.  Thus,  it  is 
possible  there  is  bias  in  the  NSS  rates  presented  here.  Nonetheless,  we  used  the  best-available 
weight  to  compare  rates  of  perpetration  and  victimization  of  workplace  violence  in  the  2011- 
2013  NSS  sample  to  the  2004-2009  HADS  sample. 

Main  Table  3  (attached)  shows  the  rate  of  first  occurrences  of  perpetration  and  victimization  of 
major  physical,  minor  physical,  and  major  sexual  workplace  (i.e.,  non-familial)  violence  per 
1,000  PY  in  the  NSS  (April  2011-  Dec  2012)  compared  to  soldiers  from  the  2004-2009  HADS 
whose  length  of  time  in  the  Regular  Army  was  32  months  or  fewer  (i.e.,  the  maximum  number  of 
NSS  follow-up  months).  The  rates  of  major  sexual  violence  perpetration  and  victimization  are 
comparable  in  the  NSS  (perpetration  =  2.7/1,000  PY;  victimization  =  4.1/1,000  PY)  and  HADS 
(perpetration  =  2.7/1,000  PY;  victimization  =  3.9/1,000  PY).  However,  rates  of  physical 
violence  perpetration  and  victimization  (both  major  and  minor)  are  significantly  lower  in  the 
NSS  (perpetration  =  1.8/1,000  PY  to  6.8/1,000  PY;  victimization  =  0.8/1,000  PY  to  4.5/1,000 
PY)  than  the  2004-2009  HADS  (perpetration  =  2.9/1,000  PY  to  13.1/1,000  PY;  victimization  = 
2.2/1,000  PY  to  7.8/1,000  PY).  Similar  patterns  were  observed  separately  among  male  and 
female  soldiers. 
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Specific  Aim  4:  Generate  deseriptive  epidemiologieal  data  on  prevalenee/soeio-demographie 
eorrelates  of  self-reported  workplaee  violenee  perpetration/vietimization  in  the  A-STARRS 
surveys. 

Update:  We  have  ereated  the  outeomes  for  self-reported  workplaee  violenee  perpetration  and 
vietimization  in  the  All- Army  Study  (AAS),  whieh  uses  a  sample  of  active  duty  Regular  Army 
soldiers  not  deployed,  and  in  the  post-deployment  phase  of  the  Pre-Post  Deployment  Study 
(PPDS)  of  three  Brigade  Combat  Teams  assessed  just  before  deployment  and  then  again  just 
after  returning  from  deployment  to  Afghanistan.  In  Year  1,  we  reported  the  prevalence  of  self- 
reported  workplace  violence  from  the  AAS  and  PPDS  using  a  preliminary  weighted  AAS  dataset 
from  April  201 1 -December  2011  (only  the  first  3  quarters  of  the  AAS)  that  had  not  yet  been 
consolidated  with  the  AAS  In- Theater  data  or  National  Guard/ Army  Reserve  data.  We  also  used 
a  preliminary  PPDS  file  that  hadn’t  been  updated  with  the  final  cohort  of  consenting  soldiers  (at 
the  time  of  the  Year  1  report  we  were  still  collecting  follow-up  data  for  our  final  time  point). 

The  final  AAS  and  PPDS  datasets  were  created  this  past  summer  and  we  now  report  the  updated 
rates  of  self-reported  workplace  violence  in  these  surveys  (see  attached  Main  Table  4).  Both 
surveys  asked  respondents  to  report  how  often  they  were  verbally  violent  in  the  past  30  days 
(i.e.,  either  yelled,  insulted,  swore,  or  threatened  someone;  had  a  heated  argument;  or  got  into  a 
loud  argument  in  a  public  place).  17.2%  of  respondents  in  the  AAS  and  13.8%  of  those  in  the 
PPDS  reported  perpetrating  at  least  one  form  of  verbal  violence  over  this  time  period.  The  survey 
also  included  a  question  about  “physical  confrontation”  during  an  argument.  A  total  of  3.0%  of 
AAS  respondents  and  also  3.1%  of  PPDS  respondents  reported  this  kind  of  experience  occurring 
in  the  30  days  before  interview.  Respondents  were  also  asked,  unrestricted  by  time,  whether  they 
had  ever  hit  another  person  to  the  point  of  bruising  and/or  healthcare  needs.  2.4%  of  respondents 
in  the  AAS  and  6.2%  of  those  in  the  PPDS  reported  such  experiences.  As  expected,  male  soldiers 
virtually  always  had  higher  rates  of  these  self-reported  perpetration  behaviors  than  female 
soldiers. 

The  AAS  and  PPDS  also  asked  a  series  of  questions  about  victimization  (see  Main  Table  4). 
These  included  questions  about  experiences  of  physical  assault  and  rape  over  the  past  12  months 
as  well  as  ever  during  a  deployment.  Past  year  physical  assault  victimization  was  reported  by 
1.1 -1.2%  of  respondents  (across  the  two  surveys)  and  past  year  sexual  assault  victimization  by 
0.2-0. 5%.  In  regard  to  experiences  during  deployment,  1. 3-2.1%  reported  physical  assault 
victimization  during  deployment  (across  the  two  surveys),  0.3- 1.0%  reported  sexual  assault 
victimization,  and  3. 8-4.1%  reported  being  bullied  by  unit  members.  Whereas  self-reported  rates 
of  physical  assault  victimization  were  generally  higher  among  men,  rates  of  sexual  assault 
victimization  and  bullying  were  consistently  higher  among  women. 

We  then  looked  at  socio-demographic  and  career  history  correlates  of  self-reported  perpetration 
and  victimization  (see  attached  Main  Table  5).  We  had  to  collapse  responses  across  certain 
outcome  items  so  that  the  coefficients  would  be  stable  (e.g.,  perpetration  of  any  verbal  violence; 
perpetration  of  any  physical  violence;  victim  of  any  violence).  These  results  are  presented  in 
Main  Table  5.  Overall,  significant  associations  were  fairly  consistent  across  the  AAS  and  PPDS. 
Most  were  also  consistent  with  expectations,  thus  we  do  not  discuss  them  all  in  detail  here.  For 
example,  females  had  significantly  lower  odds  of  reporting  violence  perpetration  (ORs  =  0.5-0. 7) 
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but  significantly  higher  odds  of  reporting  victimization  (ORs  =  1.6-2. 5).  Soldiers  over  30  years 
old  had  signifieantly  lower  odds  of  perpetrating  violenee  (ORs  =  0. 1-0.5)  but  higher  odds  of 
vietimization  (ORs  =  3.5-14.1).  A  similar  pattern  was  found  for  months  in  aetive  and  non-aetive 
service;  soldiers  with  86+  months  in  service  were  at  signifieantly  lower  odds  of  self-reported 
perpetration  (ORs  =  0.5-0. 7)  but  higher  odds  of  being  a  victim  (ORs  =  3. 6-5. 3).  The  only 
signifieant  assoeiation  for  raee-ethnicity  was  for  self-reported  verbal  violence  in  the  AAS; 
soldiers  identifying  as  “other”  race  were  at  lower  odds  of  verbal  violenee  perpetration  than  Non- 
Hispanic  White  soldiers.  Interestingly,  these  findings  for  race  contrast  those  found  in  our  HADS 
and  NSS  models  that  defined  perpetration  outcomes  using  administrative  erime  reeords  (see 
Speeific  Aims  7  and  8  below).  This  suggests  possible  bias  either  in  the  likelihood  of  arrest  and/or 
the  likelihood  of  self-reporting  violenee  perpetration.  We  plan  to  further  examine  these  potential 
biases  over  the  next  year  as  we  pursue  Speeific  Aim  5. 

Specific  Aim  5:  To  study  patterns-predictors  of  under-reporting  consenting  soldier 
administrative  data  system  reports  of  workplaee  violenee  vietimization  in  the  AAS  and  PPDS 
samples  by  comparing  self-reported  with  administratively-recorded  vietimization.  Develop 
correction  procedures  and  a  computer  program  for  Army  leadership  to  use  in  adjusting  for  under¬ 
reporting  in  future  analyses  of  Army  administrative  databases. 

Update:  Given  the  delays  in  obtaining  administrative  data  in  Year  1,  we  have  not  yet  been  able 
to  link  AAS  and  PPDS  self-reported  responses  to  administrative  criminal  records.  Thus,  no 
progress  has  been  made  on  this  aim. 

Specific  Aim  6:  Analyze  longitudinal  profiles  of  reeurrenee  of  administratively-reeorded  and 
self-reported  workplace  violence  in  the  AAS  and  PPDS  surveys. 

Update:  Given  the  delays  we  experieneed  obtaining  administrative  data  in  Year  1,  we  have  not 
yet  been  able  to  link  AAS  and  PPDS  responses  to  administrative  eriminal  records.  Thus,  no 
progress  has  been  made  on  this  aim. 

Specific  Aim  7:  Use  data  mining  methods  to  develop  predietion  equations  for  HADS  predietors 
of  administratively-reeorded  workplaee  violenee  perpetration-victimization  in  the  HADS  dataset. 
Parallel  prediction  equations  will  be  estimated  for  a  number  of  different  types  of  violenee  to 
determine  if  the  signifieant  predietors  vary  depending  on  the  severity,  persistence,  or  eharacter  of 
the  violent  aets.  Cross-validate  final  predietion  equations.  Develop  a  computer  program  for 
Army  leadership  to  generate  risk  seores  for  individual  soldiers  based  on  HADS  profiles. 

Update:  We  spent  the  majority  of  Year  2  working  on  Speeific  Aims  7  and  8  and  have  made 
substantial  progress.  In  regard  to  Aim  7,  we  prioritized  the  development  of  HADS  algorithms  to 
predict  the  first  oeeurrence  of  10  outeomes:  major  physical  workplace  violence  perpetration 
(separately  among  male  and  female  soldiers  and  also  in  the  total  sample),  major  physieal 
workplaee  violenee  vietimization  (males;  females;  total  sample),  major  workplaee  sexual 
violence  vietimization  (males;  females;  total  sample),  and  major  workplace  sexual  violenee 
perpetration  (among  males  only).  We  are  still  in  the  proeess  of  analyzing  the  models  for  first 
perpetration  and  victimization  of  minor  violence  (men;  women;  and  total  sample)  and  these 
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results  are  not  ineluded  in  this  report.  We  also  plan  on  developing  parallel  models  in  the  future 
that  predict  the  recurrence  of  workplace  violence. 

Developing  the  prediction  equations  has  been  a  long  and  arduous  task.  It  has  involved 
consultation  with  many  statisticians  and  experts  in  the  field  of  data  mining  to  develop  a 
methodology  that  would  identify  not  only  the  optimal  area  under  the  receiver  operating 
characteristic  curve  (AUC)  and  concentration  of  risk  (CR,  i.e.,  the  percent  of 
perpetrations/victimizations  occurring  among  soldiers  with  the  highest  predicted  risk 
probabilities)  but  also  provide  a  list  of  the  interpretable  predictors  that  could  later  be  used  to  “dig 
down”  into  the  theoretical  implications  of  the  models.  It  is  also  noteworthy  that  the  data  mining 
methods  that  were  ultimately  used  required  substantial  computing  power  (e.g.,  cross-validation 
models  could  take  as  long  as  one  week  to  converge). 

HADS  analyses  began  by  defining  the  administrative  outcomes  in  the  2004-2009  data, 
generating  person-year  samples  for  each  of  the  10  outcomes,  and  reviewing  bivariate 
associations  with  four  types  of  potentially  important  administrative  predictors  of  workplace 
violence  perpetration  and  victimization  (i.e.,  socio-demographic  factors,  military  career 
experiences,  prior  crime  experiences,  and  health/stress  [mental  and  physical]).  Several  hundred 
significant  predictors  were  found  in  these  bivariate  models,  many  of  which  were  consistent  with 
study  hypotheses.  Accordingly,  below  we  review  some,  but  not  all,  of  the  significant  bivariate 
associations.  Tables  for  these  bivariate  results  are  also  available  upon  request. 

HADS  bivariate  models.  In  the  total  sample,  perpetration  of  major  physical  violence  was 
significantly  inversely  related  to:  age  at  enlistment,  current  deployment,  rank,  time  in  service, 
AFQT  score,  age,  education,  and  number  of  dependents.  It  was  also  positively  associated  with 
being  male,  in  the  infantry,  unmarried,  Non-Hispanic  Black  or  Hispanic,  having  prior 
perpetration  or  victimization  of  other  violent  crimes,  having  a  positive  drug  test  in  the  past  3 
months,  having  healthcare  visits  for  impulse  control  disorders,  taking  medications  for  anxiety, 
chemical  dependency,  or  cognitive  disorders,  having  registered  guns,  percentage  of  combat 
deaths  and  suicide  deaths  in  one’s  unit  over  the  past  year,  and  number  of  different  duty  units  over 
the  past  year.  Bivariate  associations  with  victimization  of  major  physical  violence  were  very 
similar  to  those  of  perpetration.  Inverse  associations  were  found  for  age  of  enlistment,  rank,  time 
in  service,  AFQT  score,  current  age,  education,  and  number  of  dependents.  Positive  associations 
were  found  for  being  never  married,  Non-Hispanic  Black,  and  female,  prior  victimization  of 
minor  physical,  major  sexual,  and  sexual  verbal  violence,  prior  perpetration  of  minor  crime  and 
drunkenness/disorderly  conduct,  having  a  positive  drug  tests  in  the  past  3  months,  having 
healthcare  visits  for  impulse  control  disorders,  and  using  medications  for  anti-anxiety,  chemical 
dependency,  and  bipolar  disorder. 

We  also  found  similar  significant  bivariate  associations  with  perpetration  and  victimization  of 
major  sexual  violence.  Among  male  soldiers,  perpetration  of  major  sexual  violence  among 
males  was  significantly  inversely  related  to  age  at  enlistment,  current  deployment,  rank,  time  in 
service,  AFQT  score,  age,  education,  number  of  dependents,  being  never  being  married,  and 
Non-Hispanic  Black  and  Hispanic.  Perpetration  of  sexual  violence  was  also  positively  associated 
with  being  in  the  infantry,  prior  perpetration  and  victimization  of  violent  crime,  being  prescribed 
sedative-hypnotics  or  medications  for  chemical  dependency,  healthcare  visits  for  impulse  control 
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disorder,  hospitalizations  for  suicidal  behaviors  in  the  past  12  month,  and  number  of  different 
duty  units  over  the  past  12  months.  In  regard  to  bivariate  associations  with  major  sexual  violence 
victimization  in  the  total  sample,  significant  positive  associations  were  found  for  being  female, 
Non-Hispanic  White,  prior  victimization  of  physical  violence,  prior  perpetration  of  sexual 
violence,  prior  drunkenness/disorderly  conduct,  having  a  positive  drug  test  in  the  past  3  months, 
and  treatment  for  impulse  control  disorders  and  chemical  dependency. 

Using  data  mining  to  develop  the  HADS  risk  equations.  Importantly,  there  were  strong  inter¬ 
correlations  among  the  400+  HADS  predictors,  making  it  impossible  to  include  all  the 
individually  significant  predictors  in  a  single  logistic  regression  equation  for  each  workplace 
violence  outcome.  Our  data  mining  approach  consequently  used  (i)  cross-validated  stepwise 
logistic  regressions  to  identify  a  final  parsimonious  set  of  predictors  for  each  outcome,  (ii)  cross- 
validated  random  forests  to  search  for  interactions  among  predictors,  and  (iii)  elastic  net 
penalized  regressions  to  stabilize  predictor  coefficients  (i.e.,  due  to  multicollinearity).  This  data 
mining  approach  is  very  similar  to  what  we  have  been  using  to  develop  suicide  risk  algorithms  in 
A-STARRS  (see  Kessler  et  ah,  in  press  attached  as  an  Appendix).  We  started  by  running  cross- 
validated  stepwise  logistic  regressions  to  determine  the  best  number  of  predictors  for  each 
outcome.  In  addition  to  running  cross-validated  models  that  did  not  restrict  the  number  of 
predictors  allowed  to  “step  in,”  we  also  specified  several  increasingly  restrictive  models  to 
reduce  the  number  of  predictors  to  a  more  manageable  number  (e.g.,  10,  15,  20,  25,  50,  or  75 
predictors).  The  AUCs  and  CRs  derived  from  cross-validated  stepwise  regression  models  were 
then  evaluated  in  order  to  select  the  final  set  of  predictors  for  each  outcome  that  resulted  in  the 
highest  classification  accuracy  and  optimal  CR.  This  was  done  for  all  10  workplace  violence 
outcomes,  and  results  suggest  that  the  models  restricted  to  allow  25  predictors  (i.e.,  for  major 
physical  violence  perpetration  among  males  and  in  the  total  sample)  or  20  predictors  (i.e.,  for 
major  physical  violence  perpetration  among  females;  major  physical  violence  victimization 
among  males,  females,  and  in  the  total  sample;  major  sexual  violence  perpetration  among  males; 
and  major  sexual  violence  victimization  among  males,  females,  and  in  the  total  sample)  had  the 
highest  classification  accuracy  (AUCs  =  0.7  to  0.9)  and  largest  proportions  of  the  particular 
outcome  occurring  among  those  in  the  top  5%  of  predicted  risk  (CRs  =  21.3%  to  72.9%). 

After  selecting  the  best  cross-validated  stepwise  logistic  regression  for  each  outcome  we  then  ran 
a  non-cross-validated  model  restricted  to  the  optimal  number  of  predictors  (i.e.,  20  or  25 
predictors  depending  on  the  outcome)  and  output  the  odds  ratios  (ORs).  We  did  this  because  of 
our  interest  in  the  theoretical  implications  of  the  models  (see  Appendix  Tables  1-10  for  the  list  of 
final  predictors  for  each  HADS  outcome).  We  also  looked  for  interactions  among  predictors  by 
running  random  forest  models  for  each  outcome.  Specifically,  we  generated  predicted 
probabilities  for  each  outcome  using  cross-validated  random  forest  models  on  the  final  sets  of 
predictors  for  each  outcome.  To  determine  the  importance  of  any  predictor  interactions,  we 
subsequently  estimated  3  logistic  regression  models  for  all  10  outcomes  using  (i)  the  stepwise 
regression  predicted  probability  alone,  (ii)  the  random  forest  predicted  probability  alone  and  (iii) 
both  the  stepwise  and  random  forest  predicted  probabilities.  We  compared  the  AUCs  and  CRs 
for  these  models  to  determine  the  incremental  predictive  validity  of  the  random  forests  models. 
For  all  outcomes  in  the  HADS  2004-2009,  the  predicted  probabilities  from  the  random  forest 
models  did  not  improve  AUC  or  CR  over  and  above  the  corresponding  stepwise  regression 
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models.  We  eonsequently  exeluded  the  random  forest  predieted  probabilities  from  all 
subsequent  models. 

Next,  we  ran  eross-validated  elastie  net  penalized  regression  models  with  various  mixing 
parameter  penalties  (i.e.,  setting  alpha  =  0.1,  0.3,  0.5,  0.7,  0.9,  and  0.99)  using  the  best  set  of 
predictor  variables  from  the  non-cross-validated  stepwise  regression.  As  mentioned  earlier,  we 
used  elastic  net  as  a  way  of  correcting  for  multicollinearity  among  the  final  sets  of  predictors. 

We  evaluated  the  AUC  and  CR  for  each  specified  mixing  parameter  penalty  and  selected  the  best 
final  elastic  net  model  for  each  outcome.  For  all  10  outcomes,  the  alpha  =  0.5  models  resulted  in 
highest  classification  accuracy  and  CR.  We  subsequently  ran  non-cross-validated  elastic  nets 
with  alpha  =  0.5  and  compared  the  AUC,  CR,  and  size  of  the  coefficients  of  the  best  predictors  to 
the  results  from  the  non-cross-validated  stepwise  regressions.  The  final  results  for  perpetration 
and  victimization  of  non- familial  major  physical  workplace  violence  can  be  found  in  Main  Table 
6  (attached),  whereas  the  final  results  for  perpetration  and  victimization  of  non- familial  major 
sexual  workplace  violence  can  be  found  in  Main  Table  7  (attached). 

In  regard  to  the  classification  accuracy  of  these  final  models,  performance  of  the  non-cross- 
validated  stepwise  logistic  regressions  were  virtually  identical  to  the  performance  of  the  non¬ 
cross-validated  elastic  nets;  both  approaches  resulted  in  AUCs  of  0.7  (for  sexual  violence 
perpetration  among  males),  0.8  (for  major  physical  violence  perpetration  and  victimization 
among  males,  females,  and  in  the  total  sample;  major  sexual  violence  victimization  among  males 
and  females),  or  0.9  (for  major  sexual  violence  victimization  in  the  total  sample).  As  an  AUC  of 
0.5  suggests  no  better  prediction  accuracy  than  would  be  expected  by  chance,  the  AUCs 
achieved  in  our  analyses  collectively  suggests  that  our  final  models  performed  quite  well  at 
predicting  the  10  workplace  violence  outcomes.  CRs  were  also  similar  across  the  final  stepwise 
and  elastic  net  models  for  each  outcome  but  did  vary  slightly  depending  on  the  outcome.  For  all 
major  physical  violence  perpetration  and  major  sexual  violence  victimization  models  (6  of  the  10 
outcomes),  over  30%  of  all  first  occurrences  of  the  outcomes  occurred  among  soldiers  in  the  top 
5%  of  predicted  risk  derived  (33.1-72.9%  of  these  6  outcomes  occurred  among  soldiers  in  the  top 
5%  of  predicted  risk).  In  comparison,  the  prediction  models  for  major  physical  violence 
victimization  and  major  sexual  violence  perpetration  (i.e.,  the  remaining  4  outcomes)  had 
slightly  lower  CRs  (i.e.,  21.9-28.8%  of  these  4  outcomes  occurred  among  soldiers  with  the  top 
5%  of  predicted  risk).  The  strongest  major  physical  violence  model  (Main  Table  6)  was  for 
major  physical  violence  perpetration  in  the  total  sample  -  37.2%  (elastic  net)  to  37.4%  (stepwise) 
of  perpetrations  occurred  among  soldiers  in  the  top  5%  of  predicted  risk.  The  weakest  physical 
violence  model  was  for  major  violence  victimization  in  the  total  sample,  where  24.5%  (elastic 
net)  to  24.7%  of  all  victimizations  occurred  among  soldiers  in  the  top  5%  of  predicted  risk.  The 
strongest  sexual  violence  model  (Main  Table  7)  was  for  major  sexual  violence  victimization  in 
the  total  sample,  resulting  in  by  far  the  highest  CR  (71.7%  [elastic  net]  to  72.9%  [stepwise]  of  all 
victimizations  occurring  among  soldiers  in  the  top  5%  of  predicted  risk).  This  occurred  primarily 
because  female  soldiers  have  much  higher  rates  of  sexual  victimization  than  male  soldiers,  and 
gender  stepped-in  as  one  of  the  best  20  predictors  for  this  outcome  in  the  total  sample.  In 
contrast,  the  weakest  sexual  perpetration/victimization  model  was  for  predicting  major  sexual 
violence  perpetration  among  males;  21.9%  of  all  occurrences  were  perpetrated  by  male  soldiers 
with  the  top  5%  of  predicted  risk. 
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In  order  to  further  interpret  the  performance  of  our  final  models,  we  also  calculated  the 
standardized  rates  of  each  outcome  per  1,000  PY  within  six  discretized  CR  groups  (i.e.,  top  1%, 
2-5%,  5-10%,  10-45%,  45-100%).  Similar  to  the  results  reported  above  for  CR,  these  rates  were 
similar  but  varied  slightly  across  the  final  non-cross-validated  stepwise  logistic  regression  and 
elastic  net  models  for  each  outcome.  The  highest  rates  of  physical  violence 
perpetration/victimization  (Main  Table  6)  were  found  in  the  model  predicting  major  physical 
violence  perpetration  among  male  soldiers;  there  were  29.2  (elastic  net)  to  29.4  (stepwise)  first 
occurrences  of  major  physical  violence  perpetration  per  1,000  PY  among  soldiers  in  the  top  1% 
of  predicted  risk.  In  comparison,  there  were  0.5  first  occurrences  per  1,000  PY  among  male 
soldiers  in  the  bottom  45-100%  of  predicted  risk  (for  both  elastic  net  and  stepwise  models)  and 
3.2  first  occurrences  of  major  physical  violence  perpetration  per  1,000  PY  among  all  male 
soldiers  in  the  sample  (Main  Table  3).  The  highest  rates  of  sexual  violence 
perpetration/victimization  (Main  Table  7)  were  found  in  the  model  predicting  major  sexual 
violence  victimization  among  female  soldiers;  there  were  1 19.3  (elastic  net)  to  120.4  (stepwise 
regression)  first  occurrences  of  sexual  violence  victimization  per  1 ,000  PY  among  female 
soldiers  in  the  top  1%  of  predicted  risk.  In  comparison,  there  were  1.8  first  occurrences  per  1,000 
PY  among  male  soldiers  in  the  bottom  45-100%  of  predicted  risk  (for  both  elastic  net  and 
stepwise  models)  and  22.5  first  occurrences  of  major  physical  violence  perpetration  per  1,000 
PY  among  all  female  soldiers  in  the  sample  (Main  Table  3). 

Caution  is  needed  in  interpreting  the  final  sets  of  20  or  25  predictors  for  each  outcome  as  the  data 
mining  methods  employed  maximized  overall  prediction  accuracy  rather  than  individual 
coefficient  accuracy.  Although  we  do  not  review  the  full  final  predictor  sets  for  each  outcome 
here,  it  is  nonetheless  noteworthy  that  (i)  several  predictors  were  identical  across  outcomes  and 
(ii)  all  four  classes  of  administrative  predictors  (socio-demographic  factors,  military  career 
experiences,  prior  crime  experiences,  and  health/stress)  were  important.  See  Appendix  Tables  1- 
10  for  the  list  of  final  predictors  for  each  model  and  the  associated  ORs  from  the  non-cross- 
validated  stepwise  logistic  regression  and  elastic  net  models.  For  example,  the  final  set  of 
predictors  for  the  4  major  physical/sexual  perpetration  outcomes  (see  Appendix  Tables  1-3  and 
7)  each  included  Non-Hispanic  Black  (stepwise  ORs  =  1.8-3. 7;  elastic  net  ORs  =  1. 4-2.0), 
current  deployment  (stepwise  ORs  =  0.2-0.4;  elastic  net  ORs  =  0. 3-0.4),  being  demoted  in  the 
past  12  months  (stepwise  ORs  =  1.5 -2.1;  elastic  net  ORs  =  1.5 -1.6),  and  perpetrating  any  crime 
in  the  prior  24  months  (stepwise  ORs  =1. 6-2.1;  elastic  net  ORs  =  1. 4-2.0).  Likewise,  all  3  of  the 
final  major  physical  violence  victimization  models  (see  Appendix  Tables  4-6)  included  current 
deployment  (stepwise  ORs  =  0.6;  elastic  net  ORs  =  0.6),  having  a  rank  of  E5  or  higher  (stepwise 
ORs  =  0.6-0. 7;  elastic  net  ORs  =  0.6-0. 7)  or  E7  or  higher  (stepwise  ORs  =  0.4;  elastic  net  ORs  = 
0.4),  having  an  AEQT  score  below  50  (stepwise  ORs  =  1.3- 1.4;  elastic  net  ORs  =  1.3- 1.4), 
perpetrating  any  crime  in  the  prior  12  months  (stepwise  ORs  =  1.2-1. 6;  elastic  net  ORs  =  1.2- 
1.6),  being  the  victim  of  any  crime  in  the  prior  12  months  (stepwise  ORs  =  1. 6-3.0;  elastic  net 
ORs  =  1.6-2. 9),  and  average  percent  of  combat  deaths  over  the  past  year  (stepwise  ORs  =  1.6- 
4.0;  elastic  net  ORs  =  1.6-3. 7).  Eurther,  all  3  of  the  final  major  sexual  violence  victimization 
models  (see  Appendix  Tables  8-10)  included  being  over  23  years  old  (stepwise  ORs  =  0.6; 
elastic  net  ORs  =  0.6-0. 7),  having  a  rank  of  E5  or  higher  (stepwise  ORs  =  0.3-0. 5;  elastic  net 
ORs  =  0.5-0. 6),  having  25+  total  months  in  service  (stepwise  ORs  =  0.6;  elastic  net  ORs  =  0.6), 
total  number  of  days  in  the  past  12  months  with  outpatient  visits  for  a  mental  health  diagnosis 
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(stepwise  ORs  =  1.2-1. 3;  elastic  net  ORs  =  1.1-1. 3),  and  number  of  different  duty  units  over  the 
past  12  months  (stepwise  ORs  =  1.2-1. 3;  elastic  net  ORs  =  1.1-1. 2). 


Specific  Aim  8:  Use  data  mining  methods  to  develop  prediction  equations  for  survey, 
neurocognitive,  genetic,  and  limited  lADF  predictors  of  administratively-recorded  workplace 
violence  perpetration  and  victimization  in  the  NSS.  Cross-validate  final  prediction  equations. 
Develop  a  computer  program  for  Army  leadership  to  generate  risk  scores  for  individual  soldiers 
based  on  lADF  profiles. 

Update:  Main  Table  8  (attached)  shows  the  standardized  rates  per  1,000  PY  for  major  physical, 
minor  physical,  and  major  sexual  workplace  violence  perpetration  and  victimization  in  the  NSS 
2011-2013.  Because  the  vast  majority  of  all  first  occurrences  of  non-familial  major  physical  and 
major  sexual  violent  crimes  were  perpetrated  by  males,  we  restricted  analysis  of  perpetration  of 
these  2  outcomes  to  males  only  (i.e.,  females  accounted  for  only  3  occurrences  of  major  physical 
violence  perpetration  and  3  occurrences  of  sexual  violence  perpetration).  We  restricted  the 
analysis  of  first  victimization  of  major  sexual  violence  to  females  for  the  same  reason.  For 
perpetration  and  victimization  of  minor  physical  crime  we  focused  on  the  total  sample.  In 
comparison,  there  were  too  few  victimizations  of  major  physical  violence  to  produce  stable 
results  and  thus  we  will  need  to  return  to  this  outcome  when  we  have  more  months  of  follow-up. 

Similar  to  the  HADS,  we  built  NSS  person-month  discrete-time  survival  analysis  data  files  to 
predict  first  onsets  of  the  workplace  violence  outcomes  from  temporally  primary  survey  data. 
These  person-month  files  start  at  the  month  of  interview  (in  the  NSS  this  would  be  the  month  of 
arriving  to  Basic  Combat  Training)  and  follow  soldiers  until  December  2013.  As  mentioned 
earlier,  the  interviews  were  conducted  from  April  2011  -  December  2012.  In  other  words,  we 
had  between  12  and  32  months  of  follow-up  for  new  soldiers  depending  on  when  they  completed 
the  NSS  survey.  Before  building  the  person-month  files,  we  constructed  hundreds  of  predictors 
from  the  NSS  survey.  There  are  many  advantages  of  survey  predictors  over  the  administrative 
predictors  used  in  HADS.  For  example,  we  had  validated  self-report  measures  of  lifetime  and 
30-day  mental  disorders  (regardless  of  treatment),  whereas  our  administrative  predictors  of 
mental  disorder  were  based  solely  on  records  of  outpatient  and  inpatient  treatment  visits  (the 
HADS  could  not  be  used  to  operationalize  mental  disorders  among  soldiers  who  did  not  have 
such  visits).  In  addition,  the  surveys  have  a  much  richer  array  of  predictors,  including  childhood 
adversities,  traumatic  events,  stressors,  lifetime  and  30-day  mental  disorders,  personality  scales, 
and  social  network  information.  However,  a  significant  limitation  of  the  current  NSS  sample  is 
that  we  have  thus  far  only  been  able  to  follow  soldiers  for  12-32  months  (through  December 
2013).  We  therefore  have  only  a  small  number  of  outcomes  to  predict  using  the  survey  data.  For 
instance,  the  sample  for  perpetration  of  major  physical  violence  in  the  NSS  had  366,263  person- 
months  and  only  56  first  occurrences  of  major  physical  violence.  In  comparison,  we  had  over  30 
million  person-months  and  5,300  first  occurrences  of  major  physical  violence  perpetration  in  the 
2004-2009  HADS  models.  As  such,  we  have  many  fewer  significant  predictors  even  though  the 
predictors  are  substantively  richer  than  those  in  HADS. 

NSS  bivariate  models.  Consistent  with  our  approach  in  the  HADS,  we  began  by  examining 
bivariate  associations  between  each  of  the  survey  predictors  with  the  5  NSS  outcomes:  major 
physical  perpetration  and  major  sexual  perpetration  among  males,  major  sexual  victimization 
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among  females,  and  minor  physieal  perpetration  and  vietimization  in  the  total  sample.  Here  we 
review  some,  but  not  all,  of  the  signifieant  bivariate  assoeiations.  Most  signifieant  assoeiations 
were  eonsistent  with  study  hypotheses.  Tables  for  these  bivariate  results  are  also  available  upon 
request. 

The  strongest  bivariate  assoeiations  with  first  perpetration  of  major  physical  violence  among 
males  included  positive  associations  with  being  Non-Hispanic  Black,  having  any  medical  failure 
at  accession,  having  had  4  or  more  years  with  anger  attacks,  1  or  more  years  with  problem  of 
conduct/behavior,  3  or  more  years  with  alcohol  or  drug  problems,  having  5  or  more  sexual 
partners  in  the  past  12  months,  and  having  severe  or  very  severe  stress  in  the  past  12  months. 
Perpetration  of  major  physical  violence  among  males  also  had  inverse  associations  with  being 
beyond  the  first  year  of  service,  current  symptoms  of  a  major  depressive  episode,  number  of 
years  with  depression  and  anxiety,  number  of  lifetime  panic  attacks,  being  previously  bullied, 
having  one  or  more  parents  with  a  history  of  depression  or  anxiety,  and  having  an  introverted 
personality.  Perpetration  of  sexual  violence  among  males  had  significant  positive  bivariate 
associations  with  being  Non-Hispanic  Black  and  Hispanic,  El  rank,  having  an  AFQT  scores 
below  the  median,  a  lifetime  history  of  oppositional  defiant  disorder,  being  sent  to  a  juvenile 
detention  center  as  a  child,  being  physically  abused  as  a  child,  and  religiosity,  and  significant 
inverse  associations  with  levels  of  introversion,  parent  history  of  mania,  having  relatives  with 
drinking  problems,  and  having  a  history  of  self-harm  behaviors.  Perpetration  of  minor  physical 
violence  in  the  total  sample  was  associated  with  many  of  the  same  predictors  described  above,  in 
addition  to  several  other  predictors  (due  to  their  being  many  more  cases  of  minor  violence 
perpetration  compared  to  the  other  outcomes).  Significant  positive  associations  with  minor 
physical  violence  perpetration  were  found  for  being  Non-Hispanic  Black,  currently  or  previously 
married,  having  a  lifetime  history  of  intermittent  explosive  disorder,  substance  use  disorders,  and 
conduct  disorder,  total  count  of  lifetime  disorders,  number  of  years  with  insomnia,  number  of 
years  with  anger  attacks,  number  of  years  with  problem  conduct/behaviors,  being  previously 
physically  assaulted,  being  in  a  foster  home  or  juvenile  detention  as  a  child,  anger/irritability, 
antisocial  personality,  social  anhedonia,  and  having  severe  or  very  severe  stress  in  the  past  12 
months.  Minor  physical  violence  perpetration  had  significant  inverse  associations  with  soldier’s 
education,  parent  education,  and  introverted  personality. 

Major  sexual  violence  victimization  among  females  was  significantly  inversely  related  to  rank, 
age  at  accession,  AFQT  score,  being  Non-Hispanic  Black,  Christian,  and  having  one  or  more 
dependents.  Several  more  variables  were  positively  associated  with  this  outcome;  having  a 
lifetime  PTSD  or  oppositional  defiant  disorder  diagnosis,  number  of  lifetime  physical  and  sexual 
assaults,  total  number  of  lifetime  traumatic  events,  being  bullied  ten  or  more  times  in  one’s  life, 
parental  history  of  psychopathology  and  violence,  being  sexually  and  physically  abused  as  a 
child,  sexual  harassment  as  a  child,  living  in  a  foster  home,  the  total  number  of  childhood 
adversities,  social  anhedonia,  affectivity  lability,  sensation  seeking  traits,  impulsivity,  lifetime 
suicide  ideation,  and  number  of  lifetime  occurrences  of  self-harm  behaviors.  In  regard  to  minor 
physical  violence  victimization  in  the  total  sample,  positive  associations  were  found  for:  being 
female,  being  bom  in  the  U.S.,  having  an  El  rank,  having  a  lifetime  history  of  mania,  total 
number  of  lifetime  mental  disorders,  having  1 1  or  more  lifetime  panic  attacks,  6  or  more  years 
with  depression,  number  of  lifetime  sexual  assaults,  total  number  of  lifetime  traumas,  several 
childhood  adversities  (e.g.,  being  in  foster  care,  childhood  sexual  assault,  childhood  sexual 
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harassment),  having  2  or  more  sexual  partners  in  the  past  12  months,  and  having  severe  or  very 
severe  stress  in  the  past  12  months.  Inverse  assoeiations  with  minor  physieal  violenee 
vietimization  were  found  for  AFQT  seores  and  personality  traits  of  agreeableness,  impulsivity, 
and  dispositional  optimism. 

Using  data  mining  to  develop  the  NSS  risk  equations.  After  evaluating  the  bivariate 
assoeiations  deseribed  above,  we  applied  the  same  data  mining  techniques  described  in  Specific 
Aim  7  (i.e.,  cross-validated  stepwise  logistic  regression  to  identify  the  final  set  of  predictors, 
non-cross-validated  stepwise  regression  and  elastic  net  models  to  determine  final  model 
performance  for  each  outcome),  with  the  only  difference  being  that  the  predictors  were  from  the 
NSS  survey  rather  than  from  HADS  administrative  data.  Thus,  we  do  not  re-report  the  order  of 
operations  in  detail  here.  There  was,  however,  one  noteworthy  difference  in  the  NSS  data  mining 
analyses.  A  number  of  NSS  predictors  had  to  be  excluded  from  the  models  predicting  major 
physical  and  major  sexual  violence  perpetration  because  the  low  rates  of  these  outcomes  resulted 
in  highly  unstable  coefficients.  As  a  result,  fewer  predictors  (compared  to  the  HADS  models) 
stepped  in  to  the  cross-validated  stepwise  regression  models  that  were  used  to  determine  the  final 
set  of  predictors  for  each  of  the  five  outcomes  (e.g.,  unrestricted  models  started  with  between  38 
and  132  predictors,  with  a  maximum  of  8  to  22  unrestricted  predictors  ultimately  stepping  in). 

AUCs  and  CR  from  the  cross-validated  stepwise  regressions  results  suggest  that  the  models 
restricted  to  allow  5  (i.e.,  for  perpetration  of  major  physical  violence  among  males  and  major 
sexual  violence  victimization  among  females),  10  (i.e.,  for  perpetration  of  major  sexual  violence 
among  males  and  minor  violence  victimization  in  the  total  sample),  or  15  (i.e.,  for  minor 
violence  perpetration  in  the  total  sample)  predictors  had  the  highest  classification  accuracy 
(AUCs  =  0.7-0. 8)  and  the  largest  proportions  of  the  outcomes  occurring  among  soldiers  with  the 
top  5%  of  predicted  risk  (CRs  =  12.0%  to  26.4%).  After  selecting  the  best  cross-validated 
stepwise  regression  for  each  outcome,  we  then  ran  non-cross-validated  models  restricting  to  the 
optimal  number  of  predictors  (i.e.,  5,  10,  or  15)  to  obtain  ORs  (see  Appendix  Tables  11-15  for 
the  list  of  final  predictors  for  each  NSS  outcome).  We  also  ran  cross-validated  random  forests 
models  for  each  outcome  in  effort  to  identify  interactions  among  the  predictors,  but  similar  to  the 
HADS  models  the  random  forests  predicted  probabilities  did  not  meaningfully  improve  AUC  or 
CR  and  were  thus  excluded  from  subsequent  models. 

The  final  predictor  sets  for  each  outcome  were  then  evaluated  using  cross-validated  elastic  net 
models  using  various  mixing  parameter  penalties  to  correct  for  multicollinearity  among 
predictors  (i.e.,  setting  alpha  =  alpha  =  0.1,  0.3,  0.5,  0.7,  0.9,  and  0.99).  Similar  to  the  10  HADS 
models,  the  5  NSS  outcomes  performed  best  at  an  alpha  of  0.5.  We  then  ran  non-cross-validated 
elastic  nets  with  alpha  =  0.5  and  compared  the  AUC,  CR,  and  size  of  the  coefficients  of  the  best 
predictors  to  the  non-cross-validated  stepwise  logistic  regression  models.  The  final  performance 
of  these  final  models  can  be  found  in  Main  Table  9  (attached).  For  all  5  outcomes,  performance 
of  the  stepwise  regression  and  elastic  net  models  were  virtually  identical  in  regard  to  overall 
classification  accuracy  (stepwise  regression  AUCs  =  0.7  to  0.8;  elastic  net  AUCs  =  0.7  to  0.8). 
Although  CRs  were  also  similar  across  stepwise  regression  and  elastic  net  models,  it  is 
noteworthy  that  the  elastic  net  performed  somewhat  better  at  predicting  major  sexual  violence 
perpetration  among  males  (stepwise  CR  =  20.6%  versus  elastic  net  CR  =  26.4%  of  all  first 
perpetrations  of  sexual  violence  occurring  among  male  soldiers  in  the  top  5%  of  predicted  risk). 


15 


The  stepwise  regression  and  elastie  net  models  predicting  perpetration  of  major  physical  violence 
among  males  had  the  highest  CR  in  the  top  5%  (28.3%  [stepwise  regression]  to  30.2%  [elastic 
net]  of  all  first  occurrences  of  major  physical  violence  perpetration  occurring  among  male 
soldiers  in  the  top  5%  of  predicted  risk),  whereas  the  models  for  minor  physical  violence 
victimization  in  the  total  sample  had  the  lowest  CR  in  the  top  5%  (18.4%  [for  both  stepwise 
regression  and  elastic  net]  of  first  victimizations  of  minor  physical  violence  occurring  among 
those  in  the  top  5%  of  predicted  risk).  In  comparison  to  the  HADS  models  described  in  Specific 
Aim  7,  the  NSS  models  had  similar  AUCs  but  consistently  lower  CRs.  This  is  not  surprising  as 
the  NSS  models  used  (i)  much  smaller  samples  (i.e.,  fewer  occurrences  of  the  defined  outcomes), 
and  (ii)  fewer  predictors  (i.e.,  some  predictors  had  to  be  omitted  from  analysis  due  to  coefficients 
instability).  Importantly,  we  expect  the  performance  of  these  NSS  models  to  improve  once  we 
obtained  an  additional  12-18  months  of  follow-up  outcome  data. 

We  also  calculated  the  standardized  rates  of  each  outcome  per  1,000  PY  within  each  of  the 
discretized  CR  groups  (top  1%,  2-5%,  5-10%,  10-45%,  45-100%).  The  highest  rates  of  physical 
violence  perpetration/victimization  were  found  in  the  models  predicting  minor  physical  violence 
perpetration  in  the  total  sample;  there  were  57.2  (stepwise  regression)  to  62.1  (elastic  net)  first 
occurrences  of  minor  physical  violence  perpetration  per  1,000  PY  among  new  soldiers  in  the  top 
1%  of  predicted  risk  (Main  Table  9).  In  comparison,  there  were  2.5  (stepwise  regression)  to  2.9 
(elastic  net)  first  occurrences  per  1,000  PY  among  all  soldiers  in  the  bottom  45-100%  of 
predicted  risk  and  6.3  first  occurrences  of  minor  physical  violence  perpetration  per  1,000  PY 
among  all  new  soldiers  in  the  sample  (Main  Table  8).  The  highest  rates  of  sexual  violence 
perpetration/victimization  were  found  in  the  model  predicting  major  sexual  violence 
victimization  among  new  female  soldiers;  there  were  1 12.6  (elastic  net)  to  157.3  (stepwise 
regression)  first  occurrences  of  sexual  violence  victimization  per  1 ,000  PY  among  female 
soldiers  in  the  top  1%  of  predicted  risk.  In  comparison,  there  were  13.6  (elastic  net)  to  14.4 
(stepwise  regression)  first  occurrences  per  1,000  PY  among  all  soldiers  in  the  bottom  45-100% 
of  predicted  risk  and  22.0  first  occurrences  of  minor  physical  violence  perpetration  per  1,000  PY 
among  all  new  soldiers  in  the  sample  (Main  Table  8). 

As  mentioned  is  Specific  Aim  7,  although  caution  is  needed  in  interpreting  the  final  set  of 
predictors  for  each  outcome,  it  is  noteworthy  that  some  predictors  were  identical  across  NSS 
perpetration  outcomes  (see  Appendix  Tables  11-15  for  the  list  of  final  predictors  for  each  model 
and  the  associated  ORs  from  the  non-cross  validated  stepwise  regression  and  elastic  net  models). 
For  example,  the  final  set  of  predictors  for  the  3  perpetration  outcomes  each  included  predictors 
for  Non-Hispanic  Black  race  (stepwise  ORs  =  2. 0-5.0;  elastic  net  ORs  =  2. 0-4. 3)  and  introverted 
personality  (stepwise  ORs  =  0.8-0. 9;  elastic  net  ORs  =  0. 8-1.0).  Notably,  the  predictor  for  Non- 
Hispanic  Black  race  was  also  included  as  one  of  the  final  predictors  in  all  6  of  the  HADS 
perpetration  models.  In  comparison,  although  there  was  no  exact  overlap  among  the  final  set  of 
predictors  for  the  2  NSS  victimization  models,  variables  representing  certain  personality  traits 
(e.g.,  thoughtfulness,  impulsivity)  and  past  trauma/childhood  adversities  were  consistently 
included  among  the  final  set  of  predictors  for  both  models  (see  Appendix  Tables  14-15). 

Specific  Aim  9:  Use  data  mining  methods  to  develop  prediction  equations  for  retrospective 
lADF  and  self-report  survey  predictors  of  administratively-recorded  workplace  violence 
perpetration-victimization  in  the  AAS  and  PPDS  datasets.  Cross-validate  final  prediction 
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equations.  Develop  a  eomputer  program  for  Army  leadership  to  generate  risk  seores  for 
individual  Soldiers  based  on  lADF  profdes. 

Update:  As  noted  above,  in  Year  2  we  foeused  on  analysis  of  the  eonsenting  soldier 
administrative  data  system  reports  of  workplaee  violenee  vietimization  in  the  NSS  and  in  the 
2004-2009  HADS.  No  progress  has  been  made  on  this  aim. 

KEY  RESEARCH  ACCOMPLISHMENTS 

•  We  generated  a  eoding  seheme  to  elassify  offense  types  and  identify  whieh  offenses  should 
be  eategorized  as  workplaee  violenee  by  reviewing  a  number  of  alternative  elassifieation 
sehemes.  We  finally  settled  on  the  Bureau  of  Justiee  Statisties  National  Correetions 
Reporting  Program  (NCRP)  olassifieation  system  (United  States  Department  of  Justiee).  We 
eategorized  offense  types  into  violent  and  non-violent  erimes  and  also  distinguished  between 
offense  types  related  to  family  versus  erimes  not  related  to  family.  All  erimes  that  are  both 
violent  and  not  related  to  family  are  elassed  as  “workplaee  violenee,”  as  these  are  the  violent 
erimes  that  were  eommitted  by  Regular  Army  personnel  while  on  aetive  duty.  We  used  this 
eoding  seheme  to  elassify  all  of  the  reported  oeeurrenees  of  eriminal  activity  in  the  following 
administrative  datasets:  Centralized  Operations  Police  Suite  /  Military  Police  Reporting 
System  (COPS/MPRS),  Criminal  Investigation  Division  Information  Management  System  / 
Automated  Criminal  Investigation/Criminal  Intelligence  (CIMS/ACI2),  Criminal 
Investigation  Division  Information  Management  System  /  Automated  System  Crime  Record 
Center  (CIMS/ASCRC),  Army  Court  Martial  Information  System  (ACMIS),  Sexual  Assault 
Data  Management  System  (SADMS). 

•  Samples  were  drawn  to  analyze  longitudinal  profiles  of  first  occurrence  and  recurrence  of 
administratively-recorded  perpetration  and  victimization  of  workplace  (non- familial)  major 
physical  (murder,  homicide,  manslaughter,  aggravated  arson,  aggravated  assault, 
kidnapping,  and  robbery),  minor  physical  (simple  assault  and  verbal  violence  such  as 
harassment,  blackmail,  extortion  and  intimidation),  and  major  sexual  violence  (rape,  sodomy 
and  sexual  assault)  in  the  HADS  2004-2009. 

•  We  created  administrative  recorded  predictors  of  workplace  violence.  The  HADS  was 
created  by  merging  data  from  38  different  Army  and  DoD  data  systems  for  each  soldier  on 
active  duty  in  the  Army  over  the  study  period  (we  had  information  on  predictors  from  2000- 
2009).  Separate  observational  records  were  created  for  each  of  the  over  37  million  person- 
months  of  active  duty  service  of  the  over  975,000  Regular  Army  soldiers  in  the  HADS,  each 
record  containing  complete  historical  administrative  information  for  the  soldier  in  question. 
We  coded  5  sets  of  administrative  predictors  from  this  data  including  socio-demographics, 
career  history  variables,  prior  criminal  activity,  health  (mental  and  physical)  and  stress,  and 
unit-level  variables 

•  Using  the  same  coding  scheme  developed  in  bullet  point  #1,  we  classified  offenses  from  the 
NSS  consenting  soldier  administrative  data  2011-2013.  We  generated  a  person-month  data 
files  starting  with  the  month  of  interview  and  following  the  soldier  prospectively  from  12-32 
months  coding  each  occurrence  of  major  physical,  minor  physical  and  major  sexual 
workplace  violence  across  the  time  period  (soldiers  interviewed  in  April  2011  were  followed 
for  32  months  while  soldiers  interviewed  in  December  2012  were  followed  12  months). 
Separate  observational  records  were  created  for  each  of  the  400,000+  person-months  of 
active  duty  service  among  20,000+  Regular  Army  soldiers  in  the  NSS. 
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•  We  developed  10  sets  of  survey  predictor  and  limited  administrative  predictor  variables  for 
the  NSS  samples  including:  socio-demographic,  accession  variables,  self-reported  mental 
disorders,  traumatic  events,  childhood  adversities,  personality  scales,  social  support 
variables,  self-reported  suicidal  behavior  (including  ideation,  plan,  attempt,  and  self-harm 
behaviors),  injuries,  and  self-reported  treatment  for  mental  disorders. 

•  Discrete-time  survival  analysis  with  time-varying  covariates  was  used  to  study  bivariate 
predictors  of  workplace  violence  in  both  the  HADS  2004-2009  and  the  NSS  consenting 
soldier  administrative  data  2011-2013. 

•  A  state-of-the-art  ensembling  machine  learning  methodology  was  used  to  develop  optimal 
risk  prediction  equations  identifying  soldiers  at  highest  risk  of  perpetration/victimization  of 
workplace  violence  in  both  the  HADS  2004-2009  and  the  NSS  2011-2013.  Thus  far  we  have 
developed  preliminary  algorithms  for  10  outcomes  in  the  HADS  and  5  outcomes  in  the  NSS. 

•  We  are  working  with  the  Army  Analytics  Group  (AAG)  to  create  a  continuously-updating 
version  of  the  HADS  for  real-time  implementation  in  targeting  and  matching. 

CONCLUSION 

In  summary,  we  have  made  substantial  progress  on  many  of  our  specific  aims  over  the  past  year. 
First,  we  have  produced  final  standardized  rates  (per  1,000  PY)  for  several  different  forms  of 
workplace  violence  perpetration  and  victimization  (e.g..  Main  Tables  1-4;  Main  Table  8). 
Collectively,  these  rates  confirm  that  a  high  proportion  of  soldiers  in  the  HADS,  NSS,  AAS,  and 
PPDS  samples  have  perpetrated  and/or  been  a  victim  of  workplace  violence.  In  general,  male 
soldiers  had  higher  rates  perpetrating  workplace  violence  (particularly  major  physical  and  major 
sexual  violence),  while  female  soldiers  had  higher  rates  of  being  victims  of  violence  (particularly 
minor  physical  violence  and  major  sexual  violence).  These  findings  underscore  the  overall 
importance  of  our  research.  Second,  we  have  developed  several  preliminary  risk  algorithms  to 
predict  the  first  occurrence  of  workplace  violence  perpetration  and  victimization  in  the  HADS 
and  NSS.  The  data  mining  methods  used  to  develop  the  risk  equations  are  cutting-edge.  The 
models  are  useful  in  that  that  can  identify  soldiers  at  the  highest  risk  of  workplace  violence 
perpetration  and  victimization  with  accuracy  much  better  than  would  be  expected  by  chance  (i.e., 
based  on  AUCs).  Soldiers  determined  to  be  at  highest  risk  (i.e.,  in  the  top  5%  of  predicted  risk) 
also  accounted  for  a  large  proportion  of  first  occurrences  of  workplace  violence  perpetration  and 
victimization.  The  coefficients  from  the  bivariate  analyses  and  final  (i.e.,  best  model)  predictor 
sets  corresponding  to  each  outcome  provide  important  information  about  specific  risk  and 
resilience  factors  of  workplace  violence  perpetration  and  victimization  that  may  eventually  be 
integrated  into  targeted  preventative  interventions.  The  ultimate  clinical  utility  of  our  risk 
algorithms  is  in  their  ability  to  identify  soldiers  with  high  perpetration/victimization  risk.  As  it  is 
both  infeasible  and  impractical  to  offer  preventive  interventions  for  workplace  violence 
perpetration  and  victimization  to  all  soldiers,  our  algorithms  can  be  used  to  prioritize  who  should 
be  offered  such  programs  (e.g.,  requiring  soldiers  in  the  top  1%  or  top  5%  of  predicted  risk  to 
attend  perpetration  or  victimization  prevention  programs). 

Moving  forward,  we  have  several  plans  to  allow  us  to  further  accomplish  our  project  goals  and 
objectives.  First,  we  are  currently  in  the  processes  of  conducting  analyses  in  the  HADS  to 
determine  if  we  can  identify  theoretically-guided  high  risk  “subtypes”  of  soldiers  who  perpetrate 
or  are  the  victim  of  violence.  Specifically,  we  are  using  cluster  analysis  to  see  if  we  can  find 
soldiers  who  share  similar  characteristics  (i.e.,  predictors)  and  may  thus  benefit  from  preventive 
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interventions  tailored  to  those  specific  characteristics.  Second,  we  plan  to  develop  parallel  risk 
equations  that  predict  the  recurrence  of  perpetration  and  victimization  of  workplace  violence  in 
the  HADS.  Third,  we  plan  to  re-estimate  our  data  mining  models  in  the  NSS  as  we  obtain 
additional  (i.e.,  longer)  follow-up  outcome  data.  This  will  allow  us  to  develop  additional  NSS 
perpetration  and  victimization  risk  algorithms  (e.g.,  for  major  physical  violence  victimization)  as 
well  as  likely  improve  the  performance  of  the  models  presented  in  this  report.  Fourth,  we  will 
merge  administrative  perpetration/victimization  outcome  data  with  the  AAS  and  PPDS  survey 
data  in  order  to  complete  Specific  Aims  5  and  6.  Fifth,  we  plan  to  use  the  data  mining  methods 
employed  in  Year  2  to  create  parallel  perpetration-victimization  risk  equations  in  the  PPDS  and 
AAS  (i.e.,  for  both  first  occurrences  and  reoccurrences  of  workplace  violence 
perpetration/victimization).  As  the  AAS  and  PPDS  surveys  include  detailed  self-report 
assessments  of  experiences  occurring  during  prior  deployments  (compared  to  the  coarse 
deployment-related  predictors  in  the  HADS),  analysis  of  these  data  will  allow  us  to  answer 
important  questions  about  the  bivariate  and  multivariate  importance  of  deployment  experiences 
in  predicting  workplace  violence  perpetration  and  victimization.  Sixth,  we  will  continue  to  work 
with  the  AAG  to  create  a  continuously-updating  version  of  the  HADS  algorithms  for  real-time 
implementation  of  our  risk  equations.  We  will  also  continue  to  have  discussions  with  the  Army 
to  determine  the  best  methods  of  implementing  our  survey-based  algorithms. 

PUBLICATIONS,  ABSTRACTS,  AND  PRESENTATIONS 

We  have  prioritized  the  publication  of  several  of  the  HADS  data  mining  prediction  equations. 

We  currently  have  two  manuscripts  that  are  nearly  ready  for  submission,  one  focusing  on  major 
physical  violence  perpetration  (among  men,  women,  and  in  the  total  sample),  and  a  second 
focusing  on  major  sexual  violence  victimization  (among  men,  women,  and  in  the  total  sample). 
We  plan  on  submitting  these  manuscripts  within  the  next  few  months.  We  otherwise  have  no 
publications,  abstracts,  or  presentations  to  report. 

INVENTIONS,  PATENTS  AND  LICENSES 

We  have  no  inventions,  patents,  or  licenses  to  report. 

REPORTABLE  OUTCOMES 

•  We  have  generated  the  rates  per  1,000  PY  for  perpetration  and  victimization  of  major  and 
minor  physical  and  major  sexual  workplace  violence  in  the  2004-2009  HADS  (all  Regular 
Army  soldiers  on  active  duty  during  this  period)  and  in  the  2011-2013  NSS  (new  soldiers). 

•  We  have  generated  the  rates  per  1,000  PY  of  self-reported  major  physical  violence,  minor 
physical  violence,  and  major  sexual  violence  in  the  consolidated  A-STARRS  AAS  and  PPDS 
survey  samples 

•  We  have  examined  numerous  bivariate  associations  of  predictors  (e.g.,  socio-demographics, 
career  history  variables,  mental  and  physical  health  constructs,  stress,  prior 
perpetration/victimization,  childhood  adversities,  traumatic  events,  social  networks)  with  the 
administrative  record  defined  perpetration/victimization  outcomes  in  the  HADS  and  NSS. 

•  We  have  examined  socio-demographic  and  career  history  bivariate  associations  with  self- 
reported  violence  in  the  AAS  and  PPDS. 

•  We  used  cutting-edge  data  mining  methods  to  develop  preliminary  risk  equations  in  the 
HADS  and  NSS.  These  predictive  models  can  identify  high  risk  soldiers  that  should  be 
targeted  with  prevention  and  intervention  programs. 
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•  Final  models  to  identify  high  risk  soldiers  are  in  progress,  and  we  have  started  working  with 
AAG  to  discuss  continuously-updated  versions  our  risk  equations  for  real-time 
implementation  in  identifying  soldiers  who  may  benefit  from  workplace  violence 
preventative  interventions. 

OTHER  ACHIEVEMENTS 

We  have  no  other  achievements  to  report. 
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APPENDICES 

•  Appendix  A:  Main  Tables.  Main  Tables  1-9  that  correspond  to  the  main  results  presented  in 
our  Overall  Project  Summary  (e.g.,  standardized  rates  of  workplace  violence,  AUCs,  CRs; 
labeled  Tables  1-9). 

•  Appendix  B:  Appendix  Tables.  Appendix  Tables  1-15  that  list  out  the  predictors  and  their 
coefficients  from  the  final  models  for  each  of  the  10  HADS  and  5  NSS  workplace  violence 
outcomes. 

•  Appendix  C:  Predicting  Suicides  After  Psychiatric  Hospitalization  in  US  Army  Soldiers. 
Example  Army  STARRS  paper  that  used  similar  data  mining  methods  to  develop  a  risk 
equation  for  predicting  post-inpatient  hospitalization  suicide  (this  paper  is  currently  in  press 
in  JAMA  Psychiatry). 

•  Appendix  D:  Quad  Chart. 
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Appendix  A:  Main  Tables 
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Main  Table  1.  Rates  per  1,000  person-years  of  workplace  (non-familial)  violence  perpetration  and  victimization  in  the  Historical  Administrative  Data  System  (HADS)  2004-2009  Regular  Army 


(n=975,051) 

Major  Physical 

Minor  Physical 

Major 

Sexual 

Minor 

Sexual 

Murder/ 

Homicide/ 

Aggravated  Aggravated 

Any 

Major 

Simple 

Blackmail/ 

Extortion/ 

Other 

Physical 

Any 

Minor 

Rape/ 

Sodomy/ 

Sexual 

Sexual 

Manslaughter  Kidnapping _ Arson _ Assault  Robbery  Physical  Assault  Intimidation  Rioting  Harassment  Violence  Physical  Assault  Verbal 

Total 


Perpetrators 


Accused 

0.2 

0.2 

0.0 

1.6 

0.2 

2.1 

9.3 

1.2 

0.0 

0.6 

0.2 

10.6 

2.8 

0.3 

Founded 

0.2 

0.2 

0.0 

1.6 

0.1 

2.0 

9.1 

1.1 

0.0 

0.6 

0.2 

10.4 

2.3 

0.3 

Judicial  Guilty 

0.1 

0.0 

0.0 

0.2 

0.0 

0.3 

0.1 

0.1 

0.0 

0.0 

0.0 

0.1 

0.4 

0.0 

NJP  guilty 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.1 

0.2 

0.0 

AAT 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.1 

0.0 

Any  Guilty 

0.1 

0.0 

0.0 

0.2 

0.0 

0.3 

0.1 

0.1 

0.0 

0.0 

0.1 

0.3 

0.7 

0.0 

Victims 

0.2 

0.1 

0.0 

1.0 

0.1 

1.4 

4.5 

0.8 

0.0 

0.3 

0.2 

5.5 

1.8 

0.1 

Males 

Perpetrators 

Accused 

0.2 

0.2 

0.0 

1.7 

0.2 

2.2 

9.8 

1.3 

0.0 

0.7 

0.2 

11.2 

3.3 

0.4 

Founded 

0.2 

0.2 

0.0 

1.7 

0.2 

2.2 

9.5 

1.2 

0.0 

0.6 

0.2 

11.0 

2.6 

0.3 

Judicial  Guilty 

0.1 

0.0 

0.0 

0.2 

0.0 

0.3 

0.1 

0.1 

0.0 

0.0 

0.0 

0.2 

0.4 

0.0 

NJP  guilty 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.1 

0.2 

0.0 

AAT 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.1 

0.1 

0.0 

Any  Guilty 

0.1 

0.0 

0.0 

0.3 

0.0 

0.4 

0.2 

0.1 

0.0 

0.0 

0.1 

0.3 

0.8 

0.1 

Victims 

0.2 

0.0 

0.0 

1.1 

0.1 

1.4 

4.0 

0.7 

0.0 

0.3 

0.1 

4.8 

0.3 

0.1 

Females 

Perpetrators 

Accused 

0.1 

0.0 

0.0 

0.9 

0.0 

1.0 

6.3 

0.5 

0.0 

0.3 

0.0 

6.9 

0.3 

0.1 

Founded 

0.1 

0.0 

0.0 

0.9 

0.0 

1.0 

6.2 

0.5 

0.0 

0.3 

0.0 

6.8 

0.3 

0.1 

Judicial  Guilty 

0.0 

0.0 

0.0 

0.1 

0.0 

0.1 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

NJP  guilty 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

AAT 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

Any  Guilty 

0.0 

0.0 

0.0 

0.1 

0.0 

0.1 

0.0 

0.0 

0.0 

0.0 

0.0 

0.1 

0.0 

0.0 

Victims 

0.2 

0.3 

0.0 

1.0 

0.1 

1.4 

7.6 

1.5 

0.0 

0.7 

0.9 

10.0 

11.3 

0.6 

Abbreviations:  NJP,  nonjudicial  punishment  (article  15);  AAT,  administrative  action  taken. 
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Main  Table  2.  Distributions  of  number  of  workplace  (non-familial)  violence  perpetrations  and  victimizations  in  the  Historical  Administrative  Data 
System  (HADS)  2004-2009  Regular  Army  (n=975, 051) 


Total 

Perpetrator  major  physical  violence 
Number  of  people 
%  of  people 
Number  of  acts 
%  of  acts 

Perpetrator  minor  physical  violence 
Number  of  people 
%  of  people 
Number  of  acts 
%  of  acts 

Perpetrator  major  sexual  violence 
Number  of  people 
%  of  people 
Number  of  acts 
%  of  acts 

Victim  major  physical  violence 
Number  of  people 
%  of  people 
Number  of  acts 
%  of  acts 

Victim  minor  physical  violence 
Number  of  people 
%  of  people 
Number  of  acts 
%  of  acts 

Victim  major  sexual  violence 
Number  of  people 
%  of  people 
Number  of  acts 
%  of  acts 

Males 

Perpetrator  major  physical  violence 
Number  of  people 
%  of  people 
Number  of  acts 
%  of  acts 

Perpetrator  minor  physical  violence 
Number  of  people 
%  of  people 
Number  of  acts 


Number  of  founded  perpetrations/victimizations  of  each  crime  type  in  the  HADS  2004-2009 


1 

2 

3 

4 

5 

6+ 

Total 

5174 

575 

82 

36 

9 

5 

5881 

0.5% 

0.1% 

0.0% 

0.0% 

0.0% 

0.0% 

0.6% 

5174 

1150 

246 

144 

45 

41 

6800 

76.1% 

16.9% 

3.6% 

2.1% 

0.7% 

0.6% 

100.0% 

23480 

3828 

813 

221 

63 

36 

28441 

2.4% 

0.4% 

0.1% 

0.0% 

0.0% 

0.0% 

2.9% 

23480 

7656 

2439 

884 

315 

243 

35017 

67.1% 

21.9% 

7.0% 

2.5% 

0.9% 

0.7% 

100.0% 

4917 

837 

222 

74 

25 

13 

6088 

0.5% 

0.09 

0.023 

0.008 

0.003 

0.0013 

0.6253 

4917 

1674 

666 

296 

125 

96 

7774 

63.2% 

21.5% 

8.6% 

3.8% 

1.6% 

1 .2% 

100.0% 

4010 

251 

22 

0 

0 

0 

4283 

0.4% 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

0.4% 

4010 

502 

66 

0 

0 

0 

4578 

87.6% 

11.0% 

1 .4% 

0.0% 

0.0% 

0.0% 

100.0% 

14644 

1595 

229 

43 

8 

6 

16525 

1.5% 

0.2% 

0.0% 

0.0% 

0.0% 

0.0% 

1.7% 

14644 

3190 

687 

172 

40 

38 

18771 

78.0% 

17.0% 

3.7% 

0.9% 

0.2% 

0.2% 

100.0% 

4380 

733 

118 

26 

3 

2 

5262 

0.4% 

0.1% 

0.0% 

0.0% 

0.0% 

0.0% 

0.5% 

4380 

1466 

354 

104 

15 

14 

6333 

69.2% 

23.1% 

5.6% 

1.6% 

0.2% 

0.2% 

100.0% 

4785 

553 

76 

35 

9 

5 

5463 

0.5% 

0.1% 

0.0% 

0.0% 

0.0% 

0.0% 

0.6% 

4785 

1106 

228 

140 

45 

41 

6345 

75.4% 

17.4% 

3.6% 

2.2% 

0.7% 

0.6% 

100.0% 

20972 

3559 

772 

210 

61 

35 

25609 

2.2% 

0.4% 

0.1% 

0.0% 

0.0% 

0.0% 

2.6% 

20972 

7118 

2316 

840 

305 

236 

31787 

66.0% 

22.4% 

7.3% 

2.6% 

1.0% 

0.7% 

100.0% 

23 


%  of  acts 


Perpetrator  major  sexual  violence 


Number  of  people 

4819 

%  of  people 

0.5% 

Number  of  acts 

4819 

%  of  acts 

62.9% 

Victim  major  physical  violence 

Number  of  people 

3431 

%  of  people 

0.4% 

Number  of  acts 

3431 

%  of  acts 

88.4% 

Victim  minor  physical  violence 

Number  of  people 

11161 

%  of  people 

1.1% 

Number  of  acts 

11161 

%  of  acts 

80.7% 

Victim  major  sexual  violence 

Number  of  people 

653 

%  of  people 

0.1% 

Number  of  acts 

653 

%  of  acts 

72.6% 

Females 

Perpetrator  major  physical  violence 

Number  of  people 

389 

%  of  people 

0.0% 

Number  of  acts 

389 

%  of  acts 

85.5% 

Perpetrator  minor  physical  violence 

Number  of  people 

2508 

%  of  people 

0.3% 

Number  of  acts 

2508 

%  of  acts 

77.6% 

Perpetrator  major  sexual  violence 

Number  of  people 

98 

%  of  people 

0.0% 

Number  of  acts 

98 

%  of  acts 

84.5% 

Victim  major  physical  violence 

Number  of  people 

579 

%  of  people 

0.1% 

Number  of  acts 

579 

%  of  acts 

83.0% 

Victim  minor  physical  violence 

Number  of  people 

3483 

%  of  people 

0.4% 

Number  of  acts 

3483 

828 

222 

74 

25 

13 

5981 

0.1% 

0.0% 

0.0% 

0.0% 

0.0% 

0.6% 

1656 

666 

296 

125 

96 

7658 

21.6% 

8.7% 

3.9% 

1.6% 

1.3% 

100.0% 

199 

17 

0 

0 

0 

3647 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

0.4% 

398 

51 

0 

0 

0 

3880 

10.3% 

1.3% 

0.0% 

0.0% 

0.0% 

100.0% 

1075 

130 

21 

5 

3 

12395 

0.1% 

0.0% 

0.0% 

0.0% 

0.0% 

1.3% 

2150 

390 

84 

25 

19 

13829 

15.5% 

2.8% 

0.6% 

0.2% 

0.1% 

100.0% 

98 

15 

0 

1 

0 

767 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

0.1% 

196 

45 

0 

5 

0 

899 

21.8% 

5.0% 

0.0% 

0.6% 

0.0% 

100.0% 

22 

6 

1 

0 

0 

418 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

44 

18 

4 

0 

0 

455 

9.7% 

4.0% 

0.9% 

0.0% 

0.0% 

100.0% 

269 

41 

11 

2 

1 

2832 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

0.3% 

538 

123 

44 

10 

7 

3230 

16.7% 

3.8% 

1 .4% 

0.3% 

0.2% 

100.0% 

9 

0 

0 

0 

0 

107 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

18 

0 

0 

0 

0 

116 

15.5% 

0.0% 

0.0% 

0.0% 

0.0% 

100.0% 

52 

5 

0 

0 

0 

636 

0.0% 

0.0% 

0.0% 

0.0% 

0.0% 

0.1% 

104 

15 

0 

0 

0 

698 

14.9% 

2.1% 

0.0% 

0.0% 

0.0% 

100.0% 

520 

99 

22 

3 

3 

4130 

0.1% 

0.0% 

0.0% 

0.0% 

0.0% 

0.4% 

1040 

297 

88 

15 

19 

4942 

24 


%  of  acts 

Victim  major  sexual  violence 

70.5% 

21.0% 

6.0% 

1.8% 

0.3% 

0.4% 

100.0% 

Number  of  people 

3727 

635 

103 

26 

2 

2 

4495 

%  of  people 

0.4% 

0.1% 

0.0% 

0.0% 

0.0% 

0.0% 

0.5% 

Number  of  acts 

3727 

1270 

309 

104 

10 

14 

5434 

%  of  acts 

68.6% 

23.4% 

5.7% 

1.9% 

0.2% 

0.3% 

100.0% 

25 


Main  Table  3.  Comparison  of  risk  per  1,000  person-years  of  first  perpetration  and  victimization  of  major  physical,  minor  physical,  and  major  sexual  workplace  (non-familial)  violence  in 


the  2004-2009  HADS'  (n=975,051)  and  the  2011-2013  NSS  (n=21,832) 

Male 

Female 

Total 

HADS 

NSS 

HADS 

NSS 

HADS 

NSS 

Risk 

SE 

Risk 

SE 

Risk 

SE 

Risk 

SE 

Risk 

SE 

Risk 

SE 

Perpetration 

Major  physical  violence 

3.2* 

0.1 

1.9 

0.3 

1.3 

0.1 

0.7 

0.4 

2.9* 

0.1 

1.8 

0.2 

Major  sexual  violence 

3.1 

0.1 

3.1 

0.3 

0.5 

0.1 

0.5 

0.3 

2.7 

0.1 

2.7 

0.3 

Minor  physical  violence 

13.8* 

0.1 

6.7 

0.5 

9.2 

0.3 

7.2 

1.3 

13.1* 

0.1 

6.8 

0.4 

Victimization 

Major  physical  violence 

2.2* 

0.0 

0.8 

0.2 

2.3* 

0.1 

0.9 

0.5 

2.2* 

0.0 

0.8 

0.2 

Major  sexual  violence 

0.6 

0.0 

1.1 

0.3 

22.5 

0.4 

25.1 

2.6 

3.9 

0.1 

4.1 

0.4 

Minor  physical  violence 

6.7* 

0.1 

3.6 

0.4 

14.2* 

0.3 

10.2 

1.5 

7.8* 

0.1 

4.5 

0.4 

‘Significant  difference  between  HADS  and  NSS  rates  at  the  .05  level,  two-sided  test. 

Abbreviations:  HADS,  Historic  Administrative  Data  System;  NSS,  New  Soldier  Study;  SE,  standard  error. 

^  In  order  to  compare  risk  in  the  HADS  and  NSS,  the  HADS  sample  was  restricted  to  soldiers  (i.e.,  person-months)  with  less  than  33  months  in  service  (i.e.,  the  maximum  number  of  NSS  follow-up  months). 
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Main  Table  4.  Prevalence  of  self-reported  verbal,  physical,  and  sexual  violence  perpetration  and  victimization  in  the  AAS  (n=9,027)  and  PPDS  (n=8,552) 

_ AAS _ _ PPDS _ 

Male  Female  Total  Male  Female  Total 


% 

n 

% 

n 

% 

n 

% 

n 

% 

n 

% 

n 

1.  Perpetrator  of  verbal  violence 

A.  Yell,  insult,  swear,  or  threaten  someone  (past  30  days) 

16.1 

1319 

12.3 

136 

15.6 

1455 

12.9 

1086 

8.6 

50 

12.4 

1136 

B.  Heated  argument  with  someone  (past  30  days) 

8.3 

656 

8.5 

93 

8.3 

749 

6.0 

519 

5.7 

33 

6.0 

552 

C.  Loud  argument  in  public  (past  30  days) 

3.2 

230 

2.2 

26 

3.1 

256 

2.6 

218 

2.5 

14 

2.6 

232 

D.  Any  perpetrator  of  verbal  violence  (past  30  days) 

17.6 

1447 

13.8 

158 

17.2 

1605 

14.2 

1197 

9.6 

56 

13.8 

1253 

II.  Perpetrator  of  physical  violence 

A.  Physical  confrontation  during  an  argument  (past  30  days) 

3.1 

279 

1.9 

23 

3.0 

302 

3.2 

257 

2.8 

16 

3.1 

273 

B.  Sometimes  hit  people  so  hard  that  they  got  bruises  or  had  to  see  a  doctor 

2.6 

217 

0.9 

14 

2.4 

231 

6.6 

539 

2.8 

16 

6.2 

555 

C.  Any  perpetrator  of  physical  violence 

5.1 

437 

2.7 

35 

4.8 

472 

9.1 

740 

5.2 

30 

8.7 

770 

III.  Victim  of  physical  or  sexual  violence,  or  bullying 

A.  Experienced  physical  assault  (during  deployment) 

2.2 

94 

1.4 

8 

2.1 

102 

1.4 

129 

0.5 

3 

1.3 

132 

B.  Experienced  physical  assault  (past  12  months) 

0.9 

71 

2.7 

24 

1.2 

95 

1.0 

71 

2.5 

16 

1.1 

87 

C.  Victim  of  sexual  assault  (during  deployment) 

0.6 

25 

3.8 

21 

1.0 

46 

0.2 

12 

1.9 

12 

0.3 

24 

C.  Sexual  assault  (during  past  12  months) 

0.1 

12 

3.1 

21 

0.5 

33 

0.1 

6 

1.6 

9 

0.2 

15 

D.  Bullied  by  unit  members  (during  deployment) 

3.2 

163 

8.5 

49 

3.8 

212 

4.1 

292 

4.3 

26 

4.1 

318 

E.  Any  victim  of  physical  or  sexual  violence,  or  bullying 

5.0 

317 

11.4 

95 

5.7 

412 

6.2 

463 

9.6 

58 

6.5 

521 

Abbreviations:  AAS,  All  Army  Study;  PPDS,  Pre-post  Deployment  Study. 
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Main  Table  5.  Odds  ratios  of  self-reported  perpetration  and  victimization  in  the  AAS  (n=9,027)  and  PPDS  (n=8,552) 


AAS 

PPDS 

AAS 

PPDS 

AAS 

PPDS 

Perpetration 

Perpetration 

Perpetration 

Perpetration 

Victim  of  any 

Victim  of  any 

verbal  violence 

verbal  violence 

physical  violence 

physical  violence 

violence 

violence 

OR  (95%  Cl) 

OR  (95%  Cl) 

OR  (95%  Cl) 

OR  (95%  Cl) 

OR  (95%  Cl) 

OR  (95%  Cl) 

Gender 

Male 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

Female 

x"i 

0.7* 

(0.6-0.9) 

9.8* 

0.6* 

(0.5-0.8) 

9.7* 

0.5* 

(0.4-0.7) 

13.9* 

0.6* 

(0.4-0.8) 

7.5* 

2.5* 

(1. 9-3.2) 

43.8* 

1.6* 

(1. 1-2.3) 

7.6* 

Age  at  interview 

Less  than  20 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

21-24 

0.7 

(0.3-1. 5) 

0.8 

(0.7-1. 1) 

0.8 

(0.5-1. 5) 

0.8 

(0.6-1. 0) 

2.5* 

(1.2-5.1) 

2.3* 

(1. 5-3.6) 

25-29 

0.7 

(0.3-1 .4) 

0.7* 

(0.5-0.8) 

0.4* 

(0.2-0.9) 

0.7* 

(0. 5-0.9) 

2.6* 

(1. 3-5.2) 

3.5* 

(2. 1-6.0) 

30-44 

0.4* 

(0.2-0.9) 

0.5* 

(0.3-0.6) 

0.4* 

(0.2-0.7) 

0.5* 

(0. 3-0.7) 

3.8* 

(2. 0-7.3) 

3.5* 

(2. 1-5.9) 

45+ 

1^4 

0.2* 

(0.1 -0.5) 

42.5* 

0.1* 

(0.0-0.4) 

60.6* 

0.1* 

(0. 0-0.2) 

82.3* 

0.1* 

(0. 0-0.6) 

18.6* 

5.3* 

(2.4-11.7) 

27.0* 

14.1* 

(5.2-38.7) 

39.7* 

Race/Ethnicity 

Non-Hispanic  White 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

Non-Hispanic  Black 

0.9 

(0.7-1 .2) 

0.7* 

(0.6-1. 0) 

1.0 

(0.8-1. 3) 

1.3 

(1.0-1. 7) 

1.0 

(0.7-1 .2) 

1.3 

(0.6-3.2) 

Hispanic 

0.9 

(0.6-1. 5) 

0.8 

(0.7-1. 1) 

1.4 

(0.6-3.3) 

1.0 

(0.7-1. 4) 

0.9 

(0.5-1 .6) 

1.0 

(0. 7-1.3) 

Other 

0.7* 

(0.5-1. 0) 

8.6* 

0.9 

(0.7-1. 2) 

6.0 

1.0 

(0. 6-2.0) 

1.2 

1.2 

(0. 9-1.5) 

4.5 

1.1 

(0.7-1 .6) 

0.5 

1.2 

(0. 9-1.6) 

1.3 

Rank 

E1-E4 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

E5-E9 

0.9 

(0.7-1. 1) 

0.9 

(0.7-1. 1) 

0.5* 

(0.4-0.7) 

0.7* 

(0. 5-0.9) 

1.2 

(0.8-1. 7) 

1.5* 

(1.2-1.9) 

Officer 

X^2 

0.2* 

(0.1 -0.3) 

80.5* 

0.3* 

(0.2-0.5) 

28.2* 

0.2* 

(0.1 -0.4) 

47.9* 

0.2* 

(0. 1-0.7) 

15.5* 

1.4 

(1. 0-2.0) 

3.8 

1.5 

(0.6-3.5) 

15.9* 

Age  of  Enlistment 

17-18 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

19-20 

1.0 

(0.8-1 .2) 

0.8* 

(0.6-0.9) 

1.3 

(0. 9-2.0) 

1.1 

(0.9-1. 4) 

1.0 

(0.7-1 .4) 

1.2 

(0.9-1. 4) 

21-23 

0.6* 

(0.4-0.8) 

0.6* 

(0.5-0.7) 

0.8 

(0.5-1. 3) 

0.8 

(0.6-1. 1) 

0.8 

(0.5-1 .2) 

0.9 

(0.6-1. 1) 

24+ 

0.5* 

(0.3-0.8) 

16.0* 

0.4* 

(0.4-0.5) 

65.3* 

0.4* 

(0. 3-0.7) 

23.3* 

0.8 

(0.5-1. 3) 

5.9 

1.1 

(0.7-1 .6) 

2.3 

1.5 

(0. 8-3.0) 

5.3 

Months  in  active  service 
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Less  than  20 


21-45 

1.0 

(0.8-1 .2) 

0.8* 

(0.6-1. 0) 

1.0 

46-85 

1.1 

(0.9-1. 3) 

0.9 

(0.7-1 .2) 

0.8 

86+ 

0.8 

(0.7-1. 0) 

0.7* 

(0.6-1. 0) 

0.5* 

Months  in  active  or  non-active  sen/ice 

Less  than  20 

10.8* 

9.7* 

21-45 

0.9 

(0.7-1. 1) 

0.9 

(0.7-1. 1) 

1.0 

46-85 

0.9 

(0.7-1. 2) 

1.0 

(0.8-1. 3) 

0.8 

86+ 

0.7* 

(0.6-1. 0) 

0.7* 

(0.6-0.9) 

0.5* 

x'j 

Deployment 

Never  Deployed 

6.2* 

11.6* 

Currently  Deployed 

0.3* 

(0.2-0.5) 

- 

- 

0.3* 

Previously  Deployed 

1.1 

(0.9-1 .4) 

- 

- 

1.0 

X^2  40.9* 


(0.6-1. 5) 

0.8 

(0. 6-1.1) 

2.8* 

(1. 5-5.1) 

2.2* 

(1. 1-4.2) 

(0.5-1. 3) 

0.9 

(0. 6-1.2) 

3.8* 

(2. 0-7.5) 

4.3* 

(2. 3-8.2) 

(0. 3-0.9) 

11.0* 

0.6* 

(0.4-0.8) 

10.6* 

4.0* 

(2. 0-7.8) 

17.0* 

4.4* 

(2. 1-9.2) 

39.7* 

— 

— 

— 

— 

- 

— 

- 

(0.7-1. 4) 

0.8 

(0.6-1. 2) 

2.4* 

(1. 4-4.3) 

2.5* 

(1. 3-4.8) 

(0.5-1. 2) 

0.8 

(0.6-1. 0) 

3.1* 

(1. 6-6.0) 

5.5* 

(2.8-11.0) 

(0. 3-0.8) 

14.2* 

0.6* 

(0.4-0.9) 

7.8 

3.6* 

(1. 8-6.9) 

14.2* 

5.3* 

(2.7-10.6) 

40.6* 

— 

- 

— 

— 

— 

— 

— 

(0.2-0.4) 

- 

- 

0.6 

(0.4-1 .0) 

- 

- 

(0.6-1. 4) 

60.3* 

- 

- 

2.7* 

(1. 7-4.1) 

117.7* 

- 

- 

’Significant  at  the  .05  level,  two-sided  test. 

Abbreviations:  AAS,  All  Army  Study;  PPDS,  Pre-post  Deployment  Study,  OR  =  odds  ratio.  Cl  =  confidence  interval. 
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Main  Table  6.  Classification  accuracy,  concentration  of  risk,  and  standardized  risk  for  predicting  the  first  occurrence  of  major  physical  workplace  perpetration  and 
victimization  in  the  2004-2009  HADS  (n=975,051) 


_ Perpetration _  _ Victimization _ 

Male  Female  Total  Male  Female  Total 


Stepwise 

Logistic 

Elastic 

Net^ 

Stepwise 

Logistic 

Elastic 

Net^ 

Stepwise 

Logistic 

Elastic 

Net' 

Stepwise 

Logistic 

Elastic 

Net' 

Stepwise 

Logistic 

Elastic 

Net' 

Stepwise 

Logistic 

Elastic 

Net' 

AUC 

0.8 

0.8 

0.8 

0.8 

0.8 

0.8 

0.8 

0.8 

0.8 

0.8 

0.8 

0.8 

Concentration  of  risk^ 

Top  1% 

14.4 

14.3 

14.4 

11.6 

14.8 

14.7 

8.2 

8.1 

13.6 

13.7 

8.3 

8.2 

2-5% 

22.7 

22.8 

20.7 

23.0 

22.6 

22.5 

17.6 

17.7 

15.2 

14.9 

16.4 

16.3 

5-10% 

13.7 

13.5 

16.7 

13.4 

13.9 

14.0 

13.4 

13.5 

13.4 

13.7 

14.1 

14.7 

10-45% 

35.6 

35.8 

38.9 

40.4 

35.9 

35.9 

42.1 

41.9 

39.6 

39.4 

43.0 

42.4 

45-100% 

13.5 

13.5 

9.3 

11.6 

12.9 

12.8 

18.7 

18.7 

18.2 

18.2 

18.2 

18.5 

Standardized  rate  per  1,000  person-years^ 

Top  1% 

29.4 

29.2 

13.5 

10.5 

27.7 

27.6 

11.1 

11.0 

19.3 

19.2 

11.3 

11.2 

2-5% 

11.5 

11.6 

4.7 

5.0 

10.6 

10.6 

6.0 

6.0 

5.3 

5.2 

5.6 

5.6 

5-10% 

5.6 

5.5 

3.0 

1.9 

5.2 

5.3 

3.7 

3.7 

3.8 

3.8 

3.9 

4.0 

10-45% 

2.1 

2.1 

1.1 

1.2 

1.9 

1.9 

1.6 

1.6 

1.6 

1.6 

1.7 

1.7 

45-100% 

0.5 

0.5 

0.2 

0.2 

0.4 

0.4 

0.5 

0.5 

0.5 

0.5 

0.5 

0.5 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  AUC,  area  under  the  receiver  operating  characteristic  curve. 

'  All  elastic  net  penalized  regression  models  performed  best  using  a  mixing  parameter  penalty  of  alpha=0.5. 

^  Final  models  were  used  to  generate  predicted  probabilities  for  each  person-month  in  the  sample.  These  predicted  probabilities  were  then  discretized  into  5  categories  (top  1%  of  predicted  probabilities; 
2-5%;  5-10%;  10-45%;  and  45-100%)  and  concentration  of  risk  (i.e.,  percent  of  all  first  occurrences  of  the  outcome  occurring  among  soldiers  with  a  particular  predicted  probability)  and  standardized 
outcome  rates  (i.e.,  per  1,000  person-years)  were  calculated. 
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Main  Table  7.  Classification  accuracy,  concentration  of  risk,  and  standardized  risk  for  predicting  the  first  occurrence  of  major  sexual 
workplace  perpetration  and  victimization  in  the  2004-2009  HADS  (n=975,051) 

_ Perpetration _  _ Victimization _ 

Male  Male  Female  Total 


Stepwise 

Logistic 

Elastic 

Net^ 

Stepwise 

Logistic 

Elastic 

Net^ 

Stepwise 

Logistic 

Elastic 

Net' 

Stepwise 

Logistic 

Elastic 

Net' 

AUC 

0.7 

0.7 

0.8 

0.8 

0.8 

0.8 

0.9 

0.9 

Concentration  of  risk^ 

Top  1% 

7.0 

7.0 

13.3 

10.7 

12.0 

11.9 

35.7 

34.8 

2-5% 

14.9 

14.9 

19.8 

20.9 

22.0 

22.0 

37.2 

36.9 

5-10% 

12.8 

13.1 

15.3 

6.9 

15.4 

15.8 

11.3 

11.6 

10-45% 

43.6 

42.6 

38.7 

45.7 

40.8 

40.5 

12.9 

13.5 

45-100% 

21.7 

22.5 

12.9 

15.7 

9.8 

9.8 

3.0 

3.2 

Standardized  rate  per  1,000  person-years^ 

Top  1% 

16.2 

16.1 

3.7 

3.1 

120.4 

119.3 

57.8 

54.6 

2-5% 

8.6 

8.6 

1.4 

1.1 

55.1 

55.2 

15.1 

15.1 

5-10% 

5.9 

5.9 

0.9 

1.2 

31.0 

31.7 

3.7 

3.8 

10-45% 

2.8 

2.8 

0.3 

0.4 

11.7 

11.6 

0.6 

0.6 

45-100% 

0.9 

0.9 

0.1 

0.1 

1.8 

1.8 

0.1 

0.1 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  AUC,  area  under  the  receiver  operating  characteristic  curve. 

^  All  elastic  net  penalized  regression  models  performed  best  using  a  mixing  parameter  penalty  of  alpha=0.5. 

^  Final  models  were  used  to  generate  predicted  probabilities  for  each  person-month  in  the  sample.  These  predicted  probabilities  were  then  discretized 
into  5  categories  (top  1%  of  predicted  probabilities;  2-5%;  5-10%;  10-45%;  and  45-100%)  and  concentration  of  risk  (i.e.,  percent  of  all  first  occurrences  of 
the  outcome  occurring  among  soldiers  with  a  particular  predicted  probability)  and  standardized  outcome  rates  (i.e.,  per  1 ,000  person-years)  were 
calculated. 
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Main  Table  8:  Risk  per  1,000  person-years  of  major  physical,  minor  physical,  and  major  sexual  workplace  violence  perpetration  and  victimization  by 
gender  and  number  of  occurrences  in  NSS  2011-2013  sample  (n=21, 832) 


Male 

Female^ 

Totar 

Person- 
level  N 

Risk 

SE 

Person- 
level  N 

Risk 

SE 

Person- 
level  N 

Risk 

SE 

Perpetration  major  physical  violence 

53 

1.9 

0.3 

3 

0.7 

0.4 

56 

1.8 

0.2 

Exactly  1 

51 

1.8 

0.3 

3 

0.7 

0.4 

54 

1.6 

0.2 

Exactly  2 

2 

0.1 

0.1 

0 

0.0 

0.0 

2 

0.1 

0.1 

Perpetration  major  sexual  violence 

87 

3.1 

0.3 

3 

0.5 

0.3 

90 

2.7 

0.3 

Exactly  1 

86 

3.0 

0.3 

3 

0.5 

0.3 

89 

2.7 

0.3 

Exactly  2 

1 

0.0 

0.0 

0 

0.0 

0.0 

1 

0.0 

0.0 

Perpetration  minor  physical  violence 

194 

6.7 

0.5 

30 

7.2 

1.3 

224 

6.8 

0.4 

Exactly  1 

181 

6.2 

0.5 

30 

7.2 

1.3 

211 

6.3 

0.4 

Exactly  2 

13 

0.5 

0.1 

0 

0.0 

0.0 

13 

0.4 

0.1 

Victimization  major  physical  violence 

25 

0.8 

0.2 

4 

0.9 

0.5 

29 

0.8 

0.2 

Exactly  1 

25 

0.8 

0.2 

4 

0.9 

0.5 

29 

0.8 

0.2 

Victimization  major  sexual  violence 

34 

1.1 

0.3 

118 

25.1 

2.6 

152 

4.1 

0.4 

Exactly  1 

30 

0.8 

0.2 

104 

22.0 

2.4 

134 

3.5 

0.3 

Exactly  2 

4 

0.3 

0.2 

14 

3.1 

0.9 

18 

0.6 

0.2 

Victimization  minor  physical  violence 

109 

3.6 

0.4 

49 

10.2 

1.5 

158 

4.5 

0.4 

Exactly  1 

102 

3.5 

0.4 

44 

8.8 

1.4 

146 

4.1 

0.4 

Exactly  2 

6 

0.2 

0.1 

5 

1.3 

0.6 

11 

0.3 

0.1 

3  or  More 

1 

0.0 

0.0 

0 

0.0 

0.0 

1 

0.0 

0.0 

Abbreviations:  NSS,  New  Soldiers  Study,  SE,  standard  error. 

^  Males:  person-level  n=18,869,  person-month  n=366,511;  females:  person-level  n=2,963,  person-month  n=54,195;  total:  person-level  n=21,832,  person-month 
n=420,706. 
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Main  Table  9.  Classification  accuracy,  concentration  of  risk,  and  standardized  risk  for  predicting  the  first  occurrence  of  major  physical,  minor  physical,  and  major  sexual 
workplace  perpetration  and  victimization  in  the  2011-2013  NSS  (n=21,832) 


Perpetration _  _ Victimization 


Major  Physical  Violence 

Major  Sexual  Violence 

Minor  Physical  Violence 

Major  Sexual  Violence 

Minor  Physical  Violence 

(Males  only) 

(Males  only) 

(Total  sample) 

(Females  only) 

(Total  sample) 

Stepwise 

Elastic 

Stepwise 

Elastic 

Stepwise 

Elastic 

Stepwise 

Elastic 

Stepwise 

Elastic 

Logistic 

Net^ 

Logistic 

Net^ 

Logistic 

Net' 

Logistic 

Net' 

Logistic 

Net' 

AUC 

0.8 

0.8 

0.7 

0.7 

0.7 

0.8 

0.7 

0.7 

0.7 

0.7 

Concentration  of  risk^ 

Top  1% 

3.8 

5.7 

5.7 

6.9 

8.0 

8.9 

5.9 

3.4 

3.8 

3.8 

2-5% 

24.5 

24.5 

14.9 

19.5 

12.5 

11.2 

13.6 

16.9 

14.6 

14.6 

5-10% 

17.0 

15.1 

16.1 

13.8 

8.0 

15.6 

13.6 

9.3 

6.3 

6.3 

10-45% 

32.1 

34.0 

28.7 

34.5 

46.4 

42.0 

34.7 

40.7 

41.8 

41.8 

45-100% 

22.6 

20.8 

34.5 

25.3 

25.0 

22.3 

32.2 

29.7 

33.5 

33.5 

Standardized  rate  per  1,000  person-years^ 

Top  1% 

8.1 

9.7 

18.7 

20.9 

57.2 

62.1 

157.3 

112.6 

25.0 

25.0 

2-5% 

8.5 

8.3 

10.4 

13.8 

20.9 

18.0 

104.5 

135.1 

20.9 

20.8 

5-10% 

7.7 

7.0 

8.8 

7.5 

10.3 

20.3 

79.0 

54.8 

6.6 

6.6 

10-45% 

1.6 

1.6 

2.4 

2.7 

8.3 

7.6 

29.1 

31.9 

5.4 

5.4 

45-100% 

0.7 

0.7 

1.7 

1.3 

2.9 

2.5 

14.4 

13.6 

2.6 

2.6 
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Appendix  Table  1 .  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the  final  model  predicting  first  perpetration 
of  major  physical  workplace  violence,  among  males  in  the  2004-2009  HADS  (using  the  best  25  predictors) 


Stepwise 

Elastic  Net 

Logistic 

(alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Core  Variables 

Year  -  2009 

0.8* 

0.8 

Demographics 

Age  -  23+ 

0.6* 

0.6 

Race  -  Non-Hispanic  Black 

1.9* 

1.9 

Race  -  Non-Hispanic  White 

0.8* 

0.8 

Education  -  Graduated  high  school 

0.6* 

0.6 

Education  -  Graduated  college 

0.4* 

0.5 

Army  Career 

MOS  Infantry 

1.3* 

1.3 

Currently  deployed 

0.3* 

0.3 

Command  -  TRADOC 

0.5* 

0.6 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

1.3* 

1.3 

Rank  -  E5+  (including  officer) 

0.8* 

0.8 

Months  in  service  - 121  + 

0.5* 

0.5 

Last  demotion  occurred  within  12  months 

1.5* 

1.5 

Crime  and  Drugs 

Perpetrator  of  any  crime  in  past  12  months  (Count  of  each  type  of  perpetration) 

1.4* 

1.4 

Perpetrator  of  minor  violence  in  past  24  months  (Count  of  days) 

1.6* 

1.6 

Perpetrator  of  any  crime  in  past  24  months  (Yes/No) 

2.0* 

2.0 

Medical  History 

1+  suicide  attempts  between  (not  including)  the  day  of  the  crime  and  12  months  before  (Yes/No) 

3.1* 

3.1 

Conduct/ODD  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

2.0* 

2.0 

Count  of  days  with  outpatient  visits  from  first  of  the  month  to  day  before  crime 

0.6* 

0.6 

Count  of  days  with  outpatient  visits  in  the  past  3  months 

1.3* 

1.3 

Any  stress  disorder  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

1.3* 

1.3 

Sedative-Hypnotic  prescribed  in  the  past  12  months 

1.5* 

1.4 

Parent  Unit 

Median  amount  of  time  in  the  Army  of  all  NCOs  (E5-9)  in  soldier’s  unit(s) 

1.0 

1.0 

Median  amount  of  time  in  the  Army  of  all  officers  (WO  and  CO)  in  soldier’s  unit(s) 

1.0* 

1.0 

Median  amount  of  time  ever  deployed  of  all  NCOs  (E5-9)  in  soldier’s  unit(s) 

1.0* 

1.0 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  MOS,  military  occupational  specialty;  ODD,  oppositional  defiant  disorder;  NCO,  non¬ 
commissioned  officer;  CO,  commissioned  officer;  WO,  warrant  officer. 
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Appendix  Table  2.  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the  final  model  predicting  first 
perpetration  of  major  physical  workplace  violence,  among  females  in  the  2004-2009  HADS  (using  the  best  20  predictors) 


Stepwise 

Elastic  Net 

Logistic 

(alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Demographics 

Race  -  Non-Hispanic  Black 

3.7* 

1.4 

Age  of  Enlistment  -  22+ 

0.6* 

0.8 

Army  Career 

Currently  deployed 

0.2* 

0.3 

Command  -  FORSCOM 

2.3* 

1.5 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

2.0* 

1.5 

Rank  -  E5+  (including  officer) 

0.4* 

0.9 

Rank  -  E7+  (including  officer) 

0.4* 

0.6 

Last  demotion  occurred  within  12  months 

2.1* 

1.6 

Crime  and  Drugs 

Perpetrator  of  verbal  violence  in  past  12  months  (Count  of  days) 

59.5* 

2.3 

Victim  of  major  violence  in  past  12  months  (Yes/No) 

2.9* 

1.2 

Victim  of  any  crime  in  past  12  months  (Yes/No) 

1.9* 

1.7 

Perpetrator  of  any  crime  in  past  24  months  (Yes/No) 

2.1* 

1.4 

Medical  History 

Alcohol  abuse/dependence  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

1.4* 

1.2 

Prior  mental  disorders  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

0.3* 

0.7 

Adjustment  disorder  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

2.4* 

1.5 

Drug-induced  mental  illness  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

8.8* 

3.4 

Non-affective  psychosis  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

14.0* 

3.0 

Occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

1.6* 

1.7 

Stressors/adversities  -  count  of  days  in  the  hospital  in  the  past  12  months 

1.3 

0.8 

Depressive  psychosis  -  occurrence  of  days  in  the  hospital  in  the  past  12  months  (Yes/No) 

13.1* 

3.0 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System. 
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Appendix  Table  3.  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the  final  model  predicting  first  perpetration 
of  major  physical  workplace  violence,  in  the  total  2004-2009  HADS  (using  the  best  25  predictors) 


Stepwise 

Elastic  Net 

Logistic 

(alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Demographics 

Age  -  23+ 

0.6* 

0.6 

Gender  -  Male 

2.5* 

2.4 

Race  -  Non-Hispanic  Black 

2.0* 

2.0 

Race  -  Non-Hispanic  White 

0.7* 

0.8 

Education  -  Graduated  high  school 

0.6* 

0.6 

Education  -  Graduated  college 

0.5* 

0.5 

Army  Career 

MOS  Infantry 

1.3* 

1.3 

Currently  Deployed 

0.3* 

0.3 

Command  -  TRADOC 

0.5* 

0.5 

Command  -  AMC/Other/Unknown 

0.5* 

0.6 

Rank  -  E5+  (including  officer) 

0.8* 

0.8 

Months  in  service  - 121  + 

0.5* 

0.6 

Last  demotion  occurred  within  12  months 

1.5* 

1.5 

Crime  and  Drugs 

Perpetrator  of  any  crime  in  past  12  months  (Count  of  each  type  of  perpetration) 

1.4* 

1.4 

Perpetrator  of  minor  violence  in  past  24  months  (Count  of  days) 

1.6* 

1.6 

Victim  of  minor  violence  in  past  24  months  (Count  of  days) 

1.7* 

1.7 

Perpetrator  of  any  crime  in  past  24  months  (Yes/No) 

2.0* 

1.9 

Medical  History 

1+  suicide  attempts  between  (not  including)  the  day  of  the  crime  and  12  months  before  (Yes/No) 

2.6* 

2.6 

Count  of  days  with  outpatient  visits  from  first  of  the  month  to  day  before  crime 

0.6* 

0.6 

Count  of  days  with  outpatient  visits  in  the  past  3  months 

1.3* 

1.3 

Adjustment  disorder  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

1.5* 

1.5 

Occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

1.3* 

1.3 

Flag  of  four  or  more  narcotic  or  psychotropic  medications  in  3-month  period  in  the  past  3  months 

1.4* 

1.4 

Parent  Unit 

Median  amount  of  time  in  the  Army  of  all  NCOs  {E5-9)  in  soldier’s  unit(s) 

1.0 

1.0 

Median  amount  of  time  in  the  Army  of  all  officers  (WO  and  CO)  in  soldier’s  unit(s) 

1.0* 

1.0 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  NCO,  non-commissioned  officer;  CO,  commissioned  officer;  WO,  warrant  officer. 
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Appendix  Table  4.  Stepwise  logistic  regression  and  elastic  net  penalized  regression  odds  ratios  for  the  final  model  predicting  first 
victimization  of  major  physical  workplace  violence,  among  males  in  the  2004-2009  HADS  (using  the  best  20  predictors) 

Stepwise  Elastic  Net 

Logistic  (alpha=0.5) 

Odds  Ratio  Odds  Ratio 


Core  Variables 


Year  -  2004 

1.5* 

1.5 

Demographics 

Age  -  23+ 

0.7* 

0.7 

Currently  Married 

0.7* 

0.7 

Education  -  Graduated  high  school 

0.7* 

0.7 

Army  Career 

Currently  Deployed 

0.6* 

0.6 

Command  -  FORSCOM 

1.6* 

1.6 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

2.9* 

2.8 

Rank  -  E5+  (including  officer) 

0.7* 

0.7 

Rank  -  E7+  (including  officer) 

0.4* 

0.4 

AFQT  score  0-49 

1.3* 

1.3 

Crime  and  Drugs 

Perpetrator  of  any  crime  in  past  12  months  (Count  of  each  type  of  perpetration) 

1.2* 

1.2 

Victim  of  any  crime  in  past  12  months  (Yes/No) 

1.6* 

1.6 

Perpetrator  of  minor  violence  in  past  24  months  (Yes/No) 

1.6* 

1.6 

Perpetrator  of  any  crime  in  past  24  months  (Yes/No) 

1.6* 

1.6 

Medical  History 

Injury  and  poisoning  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

1.2* 

1.2 

Any  substance  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

1.6* 

1.6 

Analgesic  -  Narcotic  short-acting  not  Schedule  II  in  the  past  12  months  (times  prescribed) 

1.2* 

1.2 

Antianxiety  Agent  -  Antihistamine  Type  in  the  past  12  months  (Prescribed  yes/No) 

1.9* 

1.9 

Parent  Unit 

Median  amount  of  time  ever  deployed  of  all  NCOs  (E5-9)  in  soldier’s  unit(s) 

1.0* 

1.0 

On  average,  the  percentage  of  combat  deaths  over  the  past  year  for  all  duty  units 

1.6* 

1.6 

’Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  AFQT,  Armed  forces  qualification  test;  NCO,  non-commissioned  officer. 
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Appendix  Table  5.  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the  finai  model  predicting  first 
victimization  of  major  physical  workplace  violence,  among  females  in  the  2004-2009  HADS  (using  the  best  20  predictors) 


Stepwise  Elastic  Net 

Logistic  (alpha=0.5) 

Odds  Ratio _ Odds  Ratio 

Army  Career 

Currently  Deployed  0.6*  0.6 

Rank  -  E5+  (including  officer)  0.6*  0.6 

Rank  -  E7+  (including  officer)  0.4*  0.4 

AFQT  score  0-49  1.4*  1.4 

Last  demotion  occurred  within  12  months  1.8*  1.8 

Crime  and  Drugs 

Perpetrator  of  any  crime  in  past  12  months  (Count  of  each  type  of  perpetration)  1.6*  1.6 

Victim  of  major  sexual  violence  in  past  12  months  (Count  of  days)  2.2*  2.2 

Perpetrator  of  non-violent  drunkenness/vagrancy/disorderly  crime  in  past  24  months  (Count  of  days)  3.3*  3.2 

Victim  of  any  crime  in  past  12  months  (Yes/No)  2.5*  2.6 

Number  of  months  since  you  had  a  positive  drug  test  (Within  3  months)  3.0*  2.9 

Medical  History 

Marital  problems  -  count  of  days  with  outpatient  visits  in  the  past  1 2  months  1.3*  1.3 

Count  of  days  with  outpatient  visits  from  first  of  the  month  to  day  before  crime  0.3*  0.4 

Count  of  days  with  outpatient  visits  in  the  past  3  months  1.4*  1.3 

Drug  dependence  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No)  2.8*  2.8 

Anxiety  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No)  1.7*  1.7 

Cccurrence  of  days  in  the  hospital  from  first  of  the  month  to  day  before  crime  (Yes/No)  6.8*  5.6 

Analgesic  -  Non-Narcotic  -  Analgesics  prescribed  in  the  past  1 2  months  1 .2*  1 .2 

Parent  Unit 

Number  of  different  duty  units  over  past  12  months  1.3*  1.3 

Median  amount  of  time  ever  deployed  of  all  NCCs  (E5-9)  in  soldier’s  unit(s)  1.0*  1.0 

Cn  average,  the  percentage  of  combat  deaths  over  the  past  year  for  all  duty  units  4.0*  3.7 


*Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  AFQT,  Armed  forces  qualification  test;  NCO,  non-commissioned  officer. 
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Appendix  Table  6.  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the  final  model  predicting  first  victimization  of 
major  physicai  vioience,  in  the  total  2004-2009  HADS  (using  the  best  20  predictors) 


Stepwise 

Elastic  Net 

Logistic 

(alpha=0.5) 

Cdds  Ratio 

Cdds  Ratio 

Demographics 

Age  -  23+  yrs 

0.8* 

0.8 

Currently  Married 

0.8* 

0.8 

Education  -  Graduated  high  school  or  higher 

0.7* 

0.7 

Army  Career 

Currently  Deployed 

0.6* 

0.6 

Command  -  FORSCOM 

1.5* 

1.4 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

2.4* 

2.3 

Rank  -  E5+  (including  officer) 

0.7* 

0.7 

Rank  -  E7+  (including  officer) 

0.4* 

0.4 

AFOT  score  0-49 

1.3* 

1.3 

Crime  and  Drugs 

Perpetrator  of  any  crime  in  past  12  months  (Count  of  each  type  of  perpetration) 

1.2* 

1.2 

Victim  of  non-violent  crime  in  past  12  months  (Yes/No) 

0.5* 

0.5 

Victim  of  any  crime  in  past  12  months  (Yes/No) 

3.0* 

2.9 

Perpetrator  of  any  crime  in  past  24  months  (Yes/No) 

1.6* 

1.6 

Medical  History 

Injury  and  poisoning  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

1.2* 

1.2 

Any  substance  disorder  -  occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

1.6* 

1.6 

Analgesic  -  Narcotic  short-acting  not  Schedule  II  prescribed  in  the  past  12  months 

1.2* 

1.2 

Parent  Unit 

Number  of  different  duty  units  over  past  12  months 

1.1* 

1.1 

Median  amount  of  time  in  the  Army  of  all  officers  (WC  and  CC)  in  soldier’s  unit(s) 

1.0* 

1.0 

Median  number  of  months  all  lower  enlisted  soldiers  (E1-E4)  have  been  on  duty  in  unit 

1.0* 

1.0 

Cn  average,  the  percentage  of  combat  deaths  over  the  past  year  for  all  duty  units 

1.6* 

1.6 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  AFQT,  Armed  forces  qualification  test;  WO,  warrant  officer;  CO,  commissioned  officer. 
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Appendix  Table  7.  Stepwise  logistic  regression  and  elastic  net  penalized  regression  odds  ratios  for  the  final  model  predicting  first 
perpetration  of  sexual  workplace  violence,  among  males  in  the  2004-2009  HADS  (using  the  best  20  predictors) 


Stepwise 

Elastic  Net 

Logistic 

(alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Core  Variables 

Year  -  2004 

0.7* 

0.8 

Demographics 

Age  -  23+ 

0.8* 

0.8 

Race  -  Non-Hispanic  Black 

1.8* 

1.8 

Race  -  Hispanic 

1.5* 

1.4 

Education  -  Graduated  high  school  or  higher 

0.8* 

0.8 

Army  Career 

Currently  Deployed 

0.4* 

0.4 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

1.3* 

1.3 

Command  -  Special  Ops 

0.5* 

0.6 

Rank  -  E7+  (including  officer) 

0.5* 

0.5 

Rank  -  Officer 

0.5* 

0.6 

AFQT  score  0-49 

1.2* 

1.2 

Last  demotion  occurred  within  12  months 

1.5* 

1.5 

Crime  and  Drugs 

Perpetrator  of  any  crime  in  past  12  months  (Count  of  each  type  of  perpetration) 

1.2* 

1.2 

Perpetrator  of  non-workplace  (familial)  sexual  violence  in  past  24  months  (Count  of  days) 

11.8* 

11.5 

Perpetrator  of  any  crime  in  past  24  months  (Yes/No) 

2.0* 

2.0 

Number  of  months  since  you  had  a  positive  test  (Within  12  months) 

0.6* 

0.6 

Medical  History 

Count  of  days  with  outpatient  visits  in  the  past  3  months 

1.3* 

1.3 

Occurrence  of  days  with  outpatient  visits  from  first  of  the  month  to  day  before  crime  (Yes/No) 

0.4* 

0.4 

Occurrence  of  days  with  outpatient  visits  in  the  past  12  months  (Yes/No) 

1.3* 

1.3 

Parent  Unit 

Number  of  different  duty  units  over  past  12  months 

1.2* 

1.2 

’Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  AFQT,  Armed  forces  qualification  test. 
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Appendix  Table  8.  Stepwise  logistic  regression  and  elastic  net  penalized  regression  odds  ratios  for  the  final  model  predicting  first 
victimization  of  sexual  workplace  violence,  among  males  in  the  2004-2009  HADS  (using  the  best  20  predictors) 

Stepwise  Elastic  Net 

Logistic  (alpha=0.5) 

Odds  Ratio  Odds  Ratio 


Core  Variables 

Year  -  2007 

1.5* 

1.0 

Year  -  2004 

0.4* 

1.0 

Demographics 

Age  -  23+  yrs 

0.6* 

0.7 

Army  Career 

Duty  MOS  -  Basic  Training 

1.6* 

1.0 

Currently  deployed 

0.7* 

1.0 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

1.8* 

1.0 

Rank  -  E5+  (including  officer) 

0.3* 

0.6 

Months  in  service  -  25+ 

0.6* 

0.6 

Crime  and  Drugs 

Perpetrator  of  sexual  violence  in  past  12  months  (Count  of  days) 

7.8* 

5.4 

Victim  of  any  crime  in  past  12  months  (Count  of  each  type  of  victimization) 

1.8* 

1.0 

Number  of  months  since  you  had  a  positive  test  (Within  12  months) 

3.4* 

1.9 

Medical  History 

Adjustment  disorder  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

1.4* 

1.1 

Injury  and  poisoning  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

1.3* 

1.0 

Count  of  days  with  outpatient  visits  from  first  of  the  month  to  day  before  crime 

0.4* 

1.0 

Count  of  days  with  outpatient  visits  in  the  past  12  months 

1.2* 

1.1 

Occurrence  of  days  with  outpatient  visits  in  the  past  3  months 

2.1* 

1.2 

Occurrence  of  days  with  hospitalizations  from  first  of  the  month  to  day  before  crime 

5.6* 

1.0 

Antidepressants  prescribed  (SSRI)  in  the  past  12  months 

2.0* 

1.2 

Parent  Unit 

Number  of  different  duty  units  over  past  12  months 

1.3* 

1.1 

Median  number  of  months  all  officers  (CO  and  WO)  have  been  on  duty  in  this  unit 

1.0* 

1.0 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System;  MOS,  military  occupational  specialty;  CO,  commissioned  officer;  WO,  warrant  officer. 
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Appendix  Table  9.  Stepwise  logistic  regression  and  elastic  net  penalized  regression  odds  ratios  for  the  final  model 
predicting  first  victimization  of  sexuai  workpiace  violence,  among  females  in  the  2004-2009  HADS  (using  the  best  20 
predictors) 


Stepwise 

Elastic  Net 

Logistic 

(alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Core  Variables 

Year  -  2004 

0.5* 

0.6 

Demographics 

Age  -  20+ 

0.8* 

0.8 

Age  -  23+ 

0.6* 

0.6 

Currently  Married 

0.7* 

0.8 

Race  -  Non-Hispanic  White 

1.9* 

1.9 

Education  -  Graduated  high  school  or  higher 

0.7* 

0.7 

Army  Career 

Command  -  FORSCOM 

1.4* 

1.4 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

2.0* 

2.0 

Rank  -  E5+  (including  officer) 

0.5* 

0.5 

Rank  -  E7+  (including  officer) 

0.3* 

0.4 

Months  in  service  - 13+ 

0.6* 

0.6 

Months  in  service  -  25+ 

0.6* 

0.6 

Crime  and  Drugs 

Perpetrator  of  sexual  violence  in  past  12  months  (Yes/No) 

7.9* 

7.6 

Victim  of  non-workplace  (familial)  sexual  violence  in  past  12  months  (Yes/No) 

8.8* 

8.7 

Victim  of  any  crime  in  past  24  months  (Yes/No) 

1.6* 

1.6 

Medical  History 

Injury  and  poisoning  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

1.2* 

1.2 

Count  of  days  with  outpatient  visits  from  first  of  the  month  to  day  before  crime 

0.6* 

0.7 

Count  of  days  with  outpatient  visits  in  the  past  12  months 

1.3* 

1.3 

PTSD  -  occurrence  of  days  in  the  hospital  in  the  past  12  months  (Yes/No) 

4.4* 

4.2 

Parent  Unit 

Number  of  different  duty  units  over  past  12  months  (Yes/No) 

1.2* 

1.2 

’Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System. 
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Appendix  Table  10.  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the  final  model 
predicting  first  victimization  of  sexual  workplace  violence,  in  the  total  2004-2009  HADS  (using  the  best  20  predictors) 


Stepwise 

Elastic  Net 

Logistic 

(alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Core  Variables 

Year-  2004 

0.5* 

0.9 

Demographics 

Age  -  23+ 

0.6* 

0.6 

Currently  Married 

0.8* 

1.0 

Gender  -  Male 

0.0* 

0.1 

Race  -  Non-Hispanic  White 

1.8* 

1.2 

Education  -  Graduated  high  school  or  higher 

0.7* 

0.8 

Army  Career 

Command  -  N/S  America,  Europe/Central/Africa,  Pacific 

1.7* 

1.1 

Rank  -  E5+  (including  officer) 

0.5* 

0.6 

Rank  -  E7+  (including  officer) 

0.3* 

0.9 

Months  in  service  - 13+ 

0.7* 

0.7 

Months  in  service  -  25+ 

0.6* 

0.6 

Crime  and  Drugs 

Perpetrator  of  sexual  violence  in  past  12  months  (Count  of  days) 

5.2* 

1.9 

Victim  of  non-workplace  (familial)  sexual  violence  in  past  12  months  (Yes/No) 

8.5* 

13.6 

Victim  of  any  crime  in  past  24  months  (Yes/No) 

1.6* 

1.1 

Number  of  months  since  you  had  a  positive  test  (Within  3  months) 

2.6* 

1.1 

Medical  History 

Injury  and  poisoning  -  count  of  days  with  outpatient  visits  in  the  past  12  months 

1.2* 

1.2 

Count  of  days  with  outpatient  visits  from  first  of  the  month  to  day  before  crime 

0.5* 

1.0 

Count  of  days  with  outpatient  visits  in  the  past  3  months 

1.3* 

1.0 

Count  of  days  with  outpatient  visits  in  the  past  12  months 

1.2* 

1.2 

Parent  Unit 

Number  of  different  duty  units  over  past  12  months 

1.3* 

1.2 

’Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  HADS,  Historical  Administrative  Data  System. 


Appendix  Table  11.  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the 
final  model  predicting  first  perpetration  of  major  physical  workplace  violence,  among  males  in  the  2011- 
2013  NSS  (using  the  best  5  predictors) 

Stepwise  Elastic  Net 

Logistic  (alpha=0.5) 

Odds  Ratio  Odds  Ratio 


Demographics 


Race/Ethnicity  -  Non-Hispanic  Black 

5.0* 

4.3 

Army  career 

Months  in  the  Army  -  Less  than  12 

0.3* 

0.3 

Medical  history 

Any  medical  failure  at  accession 

3.3* 

2.7 

Personality  traits 

Introversion 

0.8* 

0.8 

Stress 

Experienced  severe  or  very  severe  stress  in  past  12  months 

2.7* 

2.5 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  NSS,  New  Soldiers  Study. 
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Appendix  Table  12.  Stepwise  logistic  regression  and  elastic  net  penalized  regression  odds  ratios 


for  the  final  model  predicting  first  perpetration  of  sexual  workplace  violence,  among  males  in  the 
2011-2013  NSS  (using  the  best  8  predictors) 

Stepwise  Elastic  Net 

Logistic  (alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Demographics 

Race/Ethnicity  -  Non-Hispanic  Black 

3.8* 

3.2 

Race/Ethnicity  -  Hispanic 

2.8* 

2.4 

Army  Career 

Months  in  the  Army  -  3  or  less 

0.3* 

0.4 

Months  in  the  Army  -  8  or  less 

1.7* 

1.3 

Personality 

Introversion 

0.8* 

0.9 

Mental  disorders 

Lifetime  ODD  (Yes/No) 

2.0* 

1.5 

Childhood  adversity 

Physically  abused  at  home 

1.5* 

1.6 

Social  networks 

Count  of  sexual  partners  in  the  past  12  months 

1.1* 

1.1 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  NSS,  New  Soldiers  Study. 
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Appendix  Table  13.  Stepwise  logistic  regression  and  elastic  net  penalized  regression  odds  ratios  for  the  final  model  predicting  first 
perpetration  of  minor  physical  workplace  violence,  in  the  total  2011-2013  NSS  (using  the  best  13  predictors) 

Stepwise  Elastic  Net 

Logistic  (alpha=0.5) 

Odds  Ratio 

Odds  Ratio 

Demographics 

Race/Ethnicity  -  Non-Hispanic  Black 

2.0* 

2.0 

Education  -  Some  College/College  Graduate 

0.3* 

0.4 

Currently/Previously  Married 

1.7* 

1.6 

Army  career 

Months  in  the  Army  -  4  or  less 

0.1* 

0.1 

AFQT  Score  (percentile)  -  43+ 

0.7* 

0.7 

Personality 

Introversion 

0.9* 

1.0 

Mental  disorders 

Number  of  years  with  lifetime  insomnia 

1.1* 

1.1 

GAD  symptoms  - 1  + 

0.5* 

0.5 

Number  of  years  with  problem  behaviors  - 1  + 

1.6* 

1.8 

Number  of  years  with  anger  attacks  -  4+ 

1.7* 

1.7 

Lifetime  Substance  Disorders  (Yes/No) 

1.7* 

1.9 

Stress 

Experienced  severe  or  very  severe  stress  in  past  12  months  regarding  health  of  loved  one 

1.5* 

1.4 

Childhood  adversity 

Sent  to  juvenile  detention  center 

4.1* 

4.0 

’Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  NSS,  New  Soldiers  Study. 
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Appendix  Table  14.  Stepwise  logistic  regression  and  elastic  net  penalized  regression  odds  ratios  for  the 
final  model  predicting  first  victimization  of  sexuai  workplace  violence,  among  females  in  the  2011-2013  NSS 


(using  the  best  5  predictors) 

Stepwise  Elastic  Net 

Logistic  (alpha=0.5) 

Odds  Ratio _ Odds  Ratio 

Demographics 

Race/Ethnicity  -  Non-Hispanic  Black  0.4*  0.3 

Army  career 

Months  in  the  Army  -  4  or  less  0.4*  0.5 

Personality 

Impulsivity-sensation  seeking  1.2*  1.1 

Pre-enlistment  trauma 

Sexual  Assault  - 1  or  more  2.0*  2.7 

Childhood  adversity 

Count  of  types  of  childhood  adversities  - 10+  2.3*  2.8 


’Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic  net  penalized 
regression  models  do  not  provide  significance  tests). 

Abbreviations:  NSS,  New  Soldiers  Study. 
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Appendix  Table  15.  Stepwise  logistic  regression  and  elastic  net  penaiized  regression  odds  ratios  for  the  final 
model  predicting  first  victimization  of  minor  physical  workplace  violence  in  the  total  2011-2013  NSS  (using  the 
best  10  predictors) 


Core  variables 

Stepwise 

Logistic 

Odds  Ratio 

Elastic  Net 
(alpha=0.5) 

Odds  Ratio 

Quarter  of  NSS  interview  -  8 

Demographics 

2.2* 

2.3 

Gender  -  Male 

0.4* 

0.3 

Born  in  the  US 

Army  career 

3.8* 

2.7 

Months  in  the  Army  -  6  or  less 

0.2* 

0.3 

Basic  Combat  Training  Site  -  Fort  Leonard  Wood 

Personality 

1.5* 

1.6 

Second  order  personality  scale  -  Thoughtfulness 

Mental  Disorders 

0.7* 

0.8 

Number  of  years  with  lifetime  insomnia  -  2+ 

Stress 

1.8* 

1.3 

Experienced  severe  or  very  severe  stress  in  past  12  months 

Trauma 

1.9* 

1.9 

Count  of  types  of  traumas  -  5+ 

Childhood  adversity 

1.5* 

1.3 

Child  Adversity  -  In  foster  home 

2.1* 

1.6 

‘Significant  at  the  .05  level,  two-sided  test.  Significance  could  only  be  calculated  for  the  stepwise  logistic  regression  models  (elastic 
net  penalized  regression  models  do  not  provide  significance  tests). 

Abbreviations:  NSS,  New  Soldiers  Study. 
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Appendix  C:  Predicting  Suicides  After  Psychiatric  Hospitalization  in  US  Army  Soldiers 


Original  Investigation 

Predicting  Suicides  After  Psychiatric  Hospitalization  in  US 
Army  Soldiers 

The  Army  Study  to  Assess  Risk  and  Resilience  in 
Servicemembers  (Army  STARRS) 

Ronald  C.  Kessler,  PhD;  Christopher  H.  Warner,  MD;  Christopher  Ivany,  MD;  Marla  V.  Petukhova,  PhD;  Sherri  Rose,  PhD;  Evelyn  J.  Bromet,  PhD; 

Millard  Brown  III,  MD,  MB;  TIanxI  Cal,  ScD;  Lisa  J.  Coipe,  PhD,  MPH;  Kenneth  L.  Cox,  MD,  MPH;  Carol  S.  Fullerton,  PhD;  Stephen  E.  Gilman,  ScD; 

Michael  J.  Gruber,  MS;  Steven  G.  Heeringa,  PhD;  Lisa  Lewandowski-Romps,  PhD;  Junlong  LI,  PhD;  Amy  M.  Mllllkan-Bell,  MD,  MPH;  James  A.  Nalfeh,  PhD; 
Matthew  K.  Nock,  PhD;  Anthony  J.  Roselllnl,  PhD;  Nancy  A.  Sampson,  BA;  Michael  Schoenbaum,  PhD;  Murray  B.  Stein,  MD,  MPH;  Simon  Wessely,  PhD; 
Alan  M.  Zaslavsky,  PhD;  Robert  J.  Ursano,  MD;  forthe  Army  STARRS  Collaborators 


IMPORTANCE  The  US  Army  experienced  a  sharp  increase  in  soldier  suicides  beginning  in 
2004.  Administrative  data  reveal  that  among  those  at  highest  risk  are  soldiers  in  the  12 
months  after  inpatient  treatment  of  a  psychiatric  disorder. 

OBJECTIVE  To  develop  an  actuarial  risk  algorithm  predicting  suicide  in  the  12  months  after  US 
Army  soldier  inpatient  treatment  of  a  psychiatric  disorder  to  target  expanded 
posthospitalization  care. 

DESIGN.  SETTING,  AND  PARTICIPANTS  There  were  53  769  hospitalizations  of  active  duty 
soldiers  from  January  1, 2004,  through  December  31,  2009,  with  International  Classification 
of  Diseases,  Ninth  Revision,  Clinical  Modification  psychiatric  admission  diagnoses. 
Administrative  data  available  before  hospital  discharge  abstracted  from  a  wide  range  of  data 
systems  (sociodemographic,  US  Army  career,  criminal  justice,  and  medical  or  pharmacy)  were 
used  to  predict  suicides  in  the  subsequent  12  months  using  machine  learning  methods 
(regression  trees  and  penalized  regressions)  designed  to  evaluate  cross-validated  linear, 
nonlinear,  and  interactive  predictive  associations. 


MAIN  OUTCOMES  AND  MEASURES  Suicides  of  soldiers  hospitalized  with  psychiatric  disorders 
in  the  12  months  after  hospital  discharge. 


RESULTS  Sixty-eight  soldiers  died  by  suicide  within  12  months  of  hospital  discharge  (12.0%  of 
all  US  Army  suicides),  equivalent  to  263.9  suicides  per  100  000  person-years  compared  with 
18.5  suicides  per  100  000  person-years  in  the  total  US  Army.  The  strongest  predictors 
included  sociodemographics  (male  sex  [odds  ratio  (OR),  7.9;  95%  Cl,  1.9-32.6]  and  late  age  of 
enlistment  [OR,  1.9;  95%  Cl,  1.0-3.5]),  criminal  offenses  (verbal  violence  [OR,  2.2;  95%  Cl, 
1.2-4.0]  and  weapons  possession  [OR,  5.6;  95%  Cl,  1.7-18.3]),  prior  suicidality  [OR,  2.9;  95% 
Cl,  1.7-4.9],  aspects  of  prior  psychiatric  inpatient  and  outpatient  treatment  (eg,  number  of 
antidepressant  prescriptions  filled  in  the  past  12  months  [OR,  1.3;  95%  Cl,  1.1-1.7]),  and 
disorders  diagnosed  during  the  focal  hospitalizations  (eg,  nonaffective  psychosis  [OR,  2.9; 
95%  Cl,  1.2-7.0]).  A  total  of  52.9%  of  posthospitalization  suicides  occurred  after  the  5%  of 
hospitalizations  with  highest  predicted  suicide  risk  (3824.1  suicides  per  100  000 
person-years).  These  highest-risk  hospitalizations  also  accounted  for  significantly  elevated 
proportions  of  several  other  adverse  posthospitalization  outcomes  (unintentional  injury 
deaths,  suicide  attempts,  and  subsequent  hospitalizations). 

CONCLUSIONS  AND  RELEVANCE  The  high  concentration  of  risk  of  suicide  and  other  adverse 
outcomes  mightjustify  targeting  expanded  posthospitalization  interventions  to  soldiers 
classified  as  having  highest  posthospitalization  suicide  risk,  although  final  determination 
requires  careful  consideration  of  intervention  costs,  comparative  effectiveness,  and  possible 
adverse  effects. 

JAMA  Psychiatry.  doi:10.1001/jamapsychiatry.2014.1754 
Published  online  November  12, 2014. 
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Original  Investigation 


Predicting  US  Army  Suicides 


The  US  Army  suicide  rate,  although  historically  below  the 
civilian  rate,  has  increased  since  2004'  to  exceed  the  ci¬ 
vilian  rate.^  Despite  numerous  efforts  to  address  this 
problem,  including  universal  interventions  (eg,  Ask/Care/ 
Escort  prevention  education  and  depression,  posttraumatic 
stress  disorder,  and  suicide  screening  in  all  primary  care  en¬ 
counters)  and  high-risk  interventions  (eg,  postdeployment 
screening),^  the  US  Army  suicide  rate  has  continued  to  in¬ 
crease.  One  potentially  important  group  for  targeted  interven¬ 
tions  is  soldiers  recently  discharged  from  inpatient  psychiat¬ 
ric  treatment.  Such  patients  have  long  been  known  to  have  a 
high  risk  of  suicide. US  military  administrative  data  docu¬ 
ment  an  8-fold  elevated  suicide  risk  in  the  3  months  after  psy¬ 
chiatric  hospitalization  and  a  5-fold  elevated  risk  for  the  re¬ 
mainder  of  the  12  months  after  hospitalization.^  A  report®  on 
the  similar  patterns  among  civilians  called  for  expansion  of 
posthospitalization  suicide  preventive  interventions,  noting 
that  such  interventions  in  the  United  Kingdom  (eg,  required 
outpatient  visits  within  1  week  of  hospital  discharge,  asser¬ 
tive  outreach  for  missed  outpatient  appointments,  24-hour 
community  crisis  teams,  and  intensive  community  support  for 
patients  difficult  to  engage  in  traditional  services)  were  asso¬ 
ciated  with  significant  before-after  reductions  in  posthospi¬ 
talization  suicides.^ 

Suicide  is  a  rare  outcome  even  among  recently  dis¬ 
charged  psychiatric  inpatients®;  therefore,  the  benefits  of  pro¬ 
viding  intensive  posthospitalization  suicide  prevention  inter¬ 
ventions  to  all  recently  discharged  inpatients  are  low.  A  more 
rational  allocation  of  treatment  resources  would  be  to  com¬ 
bine  relatively  inexpensive  universal  interventions^  with  more 
intensively  targeted  high-risk  interventions."^  However,  this 
tiered  approach  would  require  developing  a  reliable  risk  strati¬ 
fication  scheme.  The  US  Department  of  Veterans  Affairs  (VA) 
and  the  US  Department  of  Defense  (DoD)  called  for  this  kind 
of  differentiation  in  their  Clinical  Practice  Guideline  (CPG)  en¬ 
titled  Assessment  and  Management  of  Patients  at  Risk  for 
Suicidef°  However,  the  CPG  provided  little  concrete  guid¬ 
ance  on  how  these  assessments  should  be  implemented.  Re¬ 
search  has  consistently  revealed  that  health  care  profession¬ 
als  are  not  accurate  in  making  such  assessments. 

One  potentially  promising  approach  to  assessing  posthos¬ 
pitalization  suicide  risk  would  be  to  use  administrative  data 
available  during  hospitalization  to  generate  an  actuarial  post¬ 
hospitalization  suicide  risk  algorithm.  Previous  research  has 
revealed  that  actuarial  suicide  prediction  is  much  more  accu¬ 
rate  than  prediction  based  on  clinical  judgment. An  in¬ 
creasing  number  of  computerized  risk  algorithms  are  being 
used  as  clinical  decision  support  tools  in  other  areas  of  medi¬ 
cine  and  have  been  found  to  improve  clinical  processes. 
Skepticism  exists  about  developing  such  an  algorithm  for  post¬ 
hospitalization  suicide  interventions  based  on  the  relatively 
weak  associations  found  in  previous  research^^  on  in- 
hospital  predictors  and  subsequent  suicides.  However,  a  stron¬ 
ger  risk  algorithm  might  be  developed  in  the  US  Army  be¬ 
cause  of  the  availability  of  integrated  administrative  data  for 
all  US  Army  personnel.  Absence  of  such  data  in  the  general 
population  is  widely  recognized  as  an  impediment  to  big  data 
health  care  solutions.^®  A  number  of  empirical  studies^^'^®  have 


documented  strong  predictive  associations  between  inte¬ 
grated  US  Army  and  DoD  administrative  data  and  subsequent 
US  Army  suicides,  although  none  attempted  to  develop  a  risk 
algorithm  for  posthospitalization  suicides.  The  objective  of  this 
study  was  to  develop  such  an  algorithm  using  administrative 
data  from  the  Historical  Administrative  Data  System  (HADS) 
of  the  Army  Study  to  Assess  Risk  and  Resilience  in  Service- 
members  (Army  STARRS).^"^ 


Methods 

Sample 

Creation  and  analysis  of  the  consolidated  and  deidentified  data 
system  were  approved  by  the  Human  Subjects  Committees  of 
the  Uniformed  Services  University  of  the  Health  Sciences  for 
the  Henry  M.  Jackson  Foundation  (the  primary  grantee),  the 
University  of  Michigan  Institute  for  Social  Research  (site  of  the 
Army  STARRS  Data  Enclave),  and  Harvard  Medical  School  (site 
of  data  analysis).  Obtaining  informed  consent  from  indi¬ 
vidual  soldiers,  most  of  whom  were  no  longer  in  service  at  the 
time  the  HADS  was  constructed,  was  not  required  because  the 
data  were  deidentified. 

There  were  53  769  regular  US  Army  hospitalizations  from 
January  1,  2004,  through  December  31,  2009,  with  any  Inter¬ 
national  Classification  of  Diseases,  Ninth  Revision,  Clinical  Modi¬ 
fication  (ICD-9-CM)  psychiatric  admission  diagnosis  exclu¬ 
sive  of  tobacco  use  disorders  (eTable  1  at  http://www.armystarrs 
.org/publications).  These  hospitalizations  involved  40  820 
soldiers  (30  763  with  1  hospitalization,  6929  with  2,  and  3128 
with  >2),  representing  0.9%  of  all  regular  US  Army  soldiers  in 
any  12-month  period.  We  excluded  the  13  936  additional  hos¬ 
pitalizations  in  which  nicotine  dependence  was  the  only  psy¬ 
chiatric  diagnosis  because  these  were  invariably  for  physical 
disorders  and  nicotine  dependence  was  noted  based  on  with¬ 
drawal  during  hospitalization.  There  was  no  elevated  post¬ 
hospitalization  suicide  risk  among  these  soldiers.  We  also  ex¬ 
cluded  the  406  additional  hospitalizations  that  occurred 
through  emergency  departments  because  of  a  suicide  at¬ 
tempt  without  an  accompanying  ICD-9-CM  psychiatric  diag¬ 
nosis.  Four  of  these  406  soldiers  died  in  the  hospital,  whereas 
none  of  the  others  died  by  suicide  in  the  next  12  months.  On 
the  basis  of  evidence  from  another  study^®  indicating  that  pre¬ 
dictors  of  posthospitalization  suicide  vary  with  time  since  dis¬ 
charge  and  elevated  risk  persists  12  months  after  discharge,  a 
discrete-time  person-month  survival  file  was  created  to  ex¬ 
amine  suicides  in  the  12  months  after  hospital  discharge,  cen¬ 
soring  all  person-months  at  the  beginning  of  new  hospitaliza¬ 
tions  or  terminations  of  active  duty  and  allowing  interactions 
between  substantive  predictors  and  time  since  hospital  dis¬ 
charge.  All  person-months  with  suicide  were  coded  1  on  the 
outcome,  and  all  others  were  coded  0.  This  file  contained 
334  936  person-months  for  a  mean  of  6.2  months  (334  936  per 
53  760  months)  after  hospital  discharge.  This  low  mean  re¬ 
flects  high  rates  of  termination  of  service  and  subsequent  hos¬ 
pitalization  within  12  months  of  each  hospitalization. 
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Measures 

The  HADS  includes  data  from  38  US  Army  and  DoD  adminis¬ 
trative  data  systems^®  (eTable  2  at  http://www.armystarrs.org 
/publications).  In  a  comprehensive  review  of  published  stud¬ 
ies  of  predictors  of  civilian  posthospitalization  suicides,  Troister 
et  aP^  found  5  replicated  classes  of  predictors:  (1)  sociodemo¬ 
graphics  (the  most  consistent  being  male  sex  and  recent  job 
loss),  (2)  history  of  prior  suicidal  behaviors,  (3)  quality  of  care 
(eg,  low  continuity  of  care),  (4)  time  since  hospital  discharge 
(inversely  related  to  suicide  risk),  and  (5)  other  psychopatho- 
logical  risk  factors  (the  most  consistent  being  nonaffective  psy¬ 
chosis,  mood  disorders,  and  multiple  comorbid  psychiatric  dis¬ 
orders).  Other  studies'^-^®’^^  found  similar  predictors.  We 
extracted  HADS  variables  operationalizing  these  predictors  and 
added  US  Army  career  variables  found  to  predict  military 
suicides,^®'^^  unit  variables,  criminal  justice  variables  (vio¬ 
lent  crime  victimization  or  perpetration),  and  measures  of  reg¬ 
istered  weapons.  All  predictors  other  than  those  that  in¬ 
volved  the  hospitalization  were  defined  as  of  the  month  before 
hospitalization,  whereas  predicted  suicides  were  in  the  12 
months  after  hospital  discharge. 

We  cast  a  wide  net  in  extracting  HADS  measures  of  the  pre¬ 
dictor  constructs.  For  example,  we  distinguished  23  catego¬ 
ries  of  psychiatric  diagnoses  defined  largely  by  aggregated  JCD- 
9-CM  codes  (eg,  attention-deficit/hyperactivity  learning 
disorders  [ICD-g-CM  codes  314. 0-315.9]),  8  additional  catego¬ 
ries  of  behavioral  stressors  (eg,  marital  problems,  other  stress¬ 
ors  or  adversities,  suicidal  ideation,  and  self-damaging  behav¬ 
ior),  and  summary  measures  of  any  prior  admission  diagnoses, 
admission  count  variables,  and  parallel  outpatient  variables 
(eTable  1  at  http://www.armystarrs.org/publications).  We  also 
included  National  Drug  Code  psychotropic  medication  codes 
collapsed  into  15  categories  (eg,  antianxiety,  antidepressant, 
and  antipsychotic)  and  25  subcategories  (eg,  selective  seroto¬ 
nin  reuptake  inhibitor,  serotonin-norepinephrine  reuptake  in¬ 
hibitor,  and  tricyclic  antidepressant)  based  on  the  First  Data¬ 
bank  Enhanced  Therapeutic  Classification  System  (http://www 
.fdbhealth.com)  (eTable  3  at  http://www.armystarrs.org 
/publications).  A  total  of  421  individual  variables  were 
constructed  (eTable  4  at  http://www.armystarrs.org 
/publications). 

Because  the  HADS  data  systems  were  not  developed  for 
research,  more  data  were  missing  and  inconsistent  in  some  (eg, 
sociodemographic)  component  data  sets  than  in  research  data 
sets.  However,  because  the  HADS  data  sets  are  updated 
monthly,  missing  values  typically  appeared  in  earlier  and/or 
later  months,  allowing  nearest  neighbor  imputations.  Remain¬ 
ing  missing  values  were  resolved  using  randomly  selected  mul¬ 
tiple  imputations.®®  Inconsistencies  were  reconciled  using  ra¬ 
tional  imputations  (eg,  a  soldier  classified  female  one  month 
but  male  other  months  was  recoded  male). 

Statistical  Analysis 

Discrete-time  (person-month)  survival  analysis®^  was  used  to 
predict  suicides  in  the  12  months  after  hospitalization  in  3  steps. 
First,  functional  forms  of  bivariate  associations  were  exam¬ 
ined  and  predictors  transformed  (usually  sets  of  nested  di¬ 
chotomies  but  some  collapsed-truncated  continuous  vari- 
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ables)  to  explore  nonlinear  multivariate  associations.  Second, 
all  predictors  were  discretized  and  analyzed  with  100  regres¬ 
sion  trees  in  distinct  bootstrap  pseudo-samples  using  the  R 
package  rpart  program®^  to  prevent  overfitting®®  and  allow  de¬ 
tecting  interactions  among  predictors. Third,  predictors 
having  significant  bivariate  associations  and  interactions 
emerging  in  10%  or  more  of  regression  trees  were  included  as 
predictors  in  multivariate  survival  models. 

A  central  challenge  in  the  third  step  was  multicollinearity 
among  the  421  predictors.  The  classic  way  to  address  this  prob¬ 
lem  is  with  stepwise  analysis, ®"^  but  this  approach  overfits.®® 
Machine  learning  methods  reduce  overfitting. ®®’®^  The  ma¬ 
chine  learning  method  we  used  was  the  elastic  net,®®  a  penal¬ 
ized  regression  method  that  provides  stable  and  sparse  esti¬ 
mates  of  model  parameters  by  explicitly  penalizing  overfitting 
with  a  composite  penalty  XfMPP  x  Piasso  +  (1  “  MPP)  x 
where  MPP  is  a  mixing  parameter  penalty  with  values  be¬ 
tween  0  and  1  that  controls  relative  weighting  between  2  types 
of  penalties:  the  lasso  penalty  and  the  ridge  penalty.  The  pa¬ 
rameter  A  controls  the  total  amount  of  penalization.®®  The  ridge 
penalty  handles  multicollinearity  by  shrinking  all  coeffi¬ 
cients  smoothly  toward  0  but  retains  all  variables  in  the 
model.'*®  The  lasso  penalty  allows  simultaneous  coefficient 
shrinkage  and  variable  selection,  tending  to  select  at  most  one 
predictor  in  each  strongly  correlated  set  but  at  the  expense  of 
giving  unstable  estimates  in  the  presence  of  high 
multicollinearity.'**  The  elastic  net  approach  of  combining  the 
ridge  and  lasso  penalties  has  the  advantage  of  yielding  more 
stable  and  accurate  estimates  than  either  the  ridge  or  lasso 
alone  while  maintaining  model  parsimony.®® 

The  3-step  approach  of  combining  regression  trees  with  pe¬ 
nalized  regression  for  variable  selection  enabled  us  to  incor¬ 
porate  possible  interactions  and  nonlinearities  in  a  clinically 
meaningful  way  while  controlling  for  possible  overfitting.  The 
R  package  glmnet  program'*^  was  used  to  estimate  penalized 
models  with  MPPs  of  0.1,  0.4,  0.7,  and  1.0  (an  MPP  of  0.0  was 
not  used  because  of  multicollinearity  in  the  full  predictor  set) . 
Internal  10-fold  cross-validation  selected  the  coefficient  in  front 
of  the  penalty.  Comparative  fit  across  the  20  specifications  (ie, 

4  MPP  values  for  each  of  5  constraints  on  the  number  of  pre¬ 
dictors)  was  evaluated  by  inspecting  the  area  under  the  re¬ 
ceiver  operating  characteristic  curve  (AUC)  and  concentra¬ 
tion  of  risk  (CR) .  The  CR  is  the  proportion  of  observed  suicides 
after  hospitalizations  in  each  ventile  (ie,  20  groups  of  hospi¬ 
talizations  of  equal  frequency)  ordered  by  predicted  suicide 
risk.  Suicide  risk  of  each  hospitalization  was  calculated  using 
coefficients  to  project  risk  as  of  12  months  after  hospital  dis¬ 
charge  regardless  of  observed  hospitalization  data  and  cen¬ 
soring  and  standardized  by  time  of  hospitalization  to  adjust 
for  temporal  variation  in  suicide  risk.  Given  that  the  number 
of  hospitalizations  per  ventile  was  much  larger  than  the  num¬ 
ber  of  suicides,  we  focused  on  the  CR  in  the  highest-risk  ven¬ 
tile  in  selecting  the  best  penalized  model. 

Once  a  best  penalized  model  was  selected,  a  conven¬ 
tional  discrete-time  survival  model  with  a  logistic  link  func¬ 
tion  was  estimated  using  the  same  predictors  as  the  best  pe¬ 
nalized  model  to  examine  how  much  the  penalty  reduced 
model  fit.  Because  the  variance  inflation  factor  of  coeffi- 
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dents  in  this  model  revealed  estimates  to  be  unstable,  we  also 
used  forward  stepwise  analysis  with  a  .05-level  entry  crite¬ 
rion  to  select  a  stable  subset  of  predictors  for  a  reduced  ver¬ 
sion  of  the  logistic  model.  Coefficients  in  this  reduced  logis¬ 
tic  model  were  then  exponentiated  to  create  odds  ratios  (ORs) 
for  ease  of  interpretation.  Ventiles  from  the  best  penalized 
model  were  then  collapsed  into  risk  strata  using  the  logic  of 
stratum-specific  likelihood  ratios.'^^  The  CR,  AUC,  and  the  stan¬ 
dardized  (for  amount  of  uncensored  time  observed  after  each 
hospitalization)  suicide  rates  per  100  000  person-years  were 
calculated  for  these  risk  strata.  F  inally,  parallel  rates  of  risk  were 
calculated  for  unintentional  injury  deaths,  attempted  sui¬ 
cides,  and  subsequent  hospitalizations  in  the  same  ventiles  to 
evaluate  other  adverse  outcomes  associated  with  posthospi¬ 
talization  suicide  risk. 


Results 

Patterns  of  Posthospitalization  Suicide 

Sixty-eight  hospitalized  soldiers  died  by  suicide  within  12 
months  of  hospital  discharge  (263.9  suicides  per  100  000  per¬ 
son-years  vs  18.5  suicides  per  100  000  in  the  total  US  Army),^^ 
representing  12.0%  of  all  US  Army  suicides.  An  additional  157 
hospitalized  soldiers  died  in  other  ways,  and  22  010  others  ter¬ 
minated  active  duty  for  other  reasons  (eg,  administrative  sepa¬ 
ration  and  retirement)  within  12  months  of  hospital  dis¬ 
charge. 

Bivariate  Associations  of  Predictors  With  Suicide 

No  interactions  emerged  in  more  than  10%  of  regression  trees. 
However,  131  of  the  421  bivariate  associations  (31.1%)  be¬ 
tween  individual  predictors  and  suicides  were  significant  at 
the  .05  level  (eTables  5-9  and  eTables  11-15  at  http://www 
.armystarrs  .org/publications) .  All  these  variables  were  used  in 
the  penalized  multivariate  models. 

Selecting  a  Best  Penalized  Survival  Model 

A  10-fold  cross-validation  revealed  that  AUC  was  maximized 
across  the  20  penalized  survival  models  for  an  MPP  of  1.0  (lasso) 
with  73  predictors  and  an  MPP  of  0.1  to  0.7  with  72  to  122  pre¬ 
dictors  (Figure  1).  Because  the  lasso  model  yielded  the  best 
cross-validated  CR  in  the  highest-risk  ventile  (52 .9%)  (Table  1), 
we  estimated  a  conventional  discrete- time  survival  model  with 
a  logistic  link  function  using  the  same  73  predictors.  This  model 
had  a  much  higher  AUC  (AUC,  0.89)  and  CR  (CR,  61.8%)  in  the 
highest-risk  ventile  than  the  lasso  model  with  the  same  pre¬ 
dictors,  but  this  was  because  of  overfitting  (variance  inflation 
factor  >5  for  6  coefficients).  Forward  stepwise  analysis  se¬ 
lected  a  more  stable  set  of  predictors  in  a  reduced  logistic 
model,  and  this  model,  which  contained  20  predictors,  had  a 
slightly  lower  AUC  (AUC,  0.84)  and  CR  (CR,  50.0%)  in  the  high¬ 
est-risk  ventile  than  the  lasso  model. 

Caution  is  needed  in  interpreting  predictors  in  the  re¬ 
duced  logistic  model  because  the  variable  selection  algo¬ 
rithm  maximized  overall  prediction  accuracy  rather  than  in¬ 
dividual  coefficient  accuracy.  It  is  nonetheless  noteworthy  that 
the  model  included  variables  in  all  predictor  classes  (Table  2): 


Figure  1.  Receiver  Operating  Characteristic  (ROC)  Curves  for  Discrete- 
Time  (Person-Month)  Elastic  Net  Penalized  Survival  Models  With 
Different  Mixing  Parameter  Penalties  (MPPs)  and  for  a  Conventional 
Discrete-Time  Survival  Model  Predicting  Posthospitalization  Suicide 
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Elastic  net  penalized  survival  models  were  estimated  with  different  MPPs, 
allowing  up  to  421  predictors.  The  best  cross-validated  model  was  an  MPP  of  1.0 
with  73  predictors.  A  conventional  discrete-time  survival  model  that  contained 
the  same  73  predictors  was  unstable  (variance  inflation  factor  >5.0  for  6 
predictors).  As  a  result,  we  used  forward  stepwise  analysis  with  a  .05-level  entry 
criterion  to  select  a  more  stable  subset  of  the  73  predictors.  Twenty  predictors 
entered  that  model.  The  ROC  curve  shown  here  for  the  conventional  model  is 
based  on  those  20  predictors.  AUC  indicates  area  under  the  receiver  operating 
characteristic  curve. 


3  sociodemographic  characteristics  (male  sex,  enlistment  at  >27 
years  of  age,  and  US  Armed  Forces  Qualification  Test  score 
>50th  percentile;  ORs,  1.9  [95%  Cl,  I.O-3.5]  to  7.9  [95%  Cl,  1.9- 
32.6]),  access  to  firearms  (number  of  registered  pistols;  OR,  1.3; 
95%  Cl,  1.0-1.6),  crime  perpetration  (weapons  possession  or  ver¬ 
bal  assault;  ORs,  2.2  [95%  Cl,  I.2-4.O]  to  5.6  [95% Cl,  1.7-18.3]), 
prior  suicidality  (ORs,  1.6  [95%  Cl,  I.I-2.5]  to  2.9  [95%  Cl,  1.7- 
4-9]),  prior  psychiatric  treatment  (ORs,  0.3  [95%  Cl,  0.2-0.6]  to 
5.6  [95%  Cl,  1.8-17.7]),  and  characteristics  of  the  focal  hospi¬ 
talization  (ORs,  0.4  [95%  Cl,  O.2-O.7]  to  6.0  [95%  Cl,  2.1-17.4]). 
The  2  ORs  less  than  1.0  were  for  (1)  being  above  the  50th  per¬ 
centile  on  the  ratio  of  number  of  psychiatric  hospitalizations 
to  time  in  service  and  (2)  posttraumatic  stress  disorder  dur¬ 
ing  current  hospitalization. 

CR  and  Conditional  Risk  Distributions 

Inspection  of  the  CR  across  predicted  risk  ventiles  led  to  cre¬ 
ation  of  4  risk  strata.  Most  suicides  occurred  in  the  highest- 
risk  stratum  (which  was  made  up  of  the  5%  of  hospitaliza¬ 
tions  in  the  highest-risk  ventile;  CR,  52.9%)  (Figure  2).  The  CR 
was  lower  (CR,  8.8%)  in  the  second  stratum  (made  up  of  the 
5%  of  hospitalizations  in  the  second-highest  ventile),  lower  still 
(CR,  4.2%)  in  a  third  stratum  (made  up  of  the  35%  of  hospital¬ 
izations  in  the  next  7  ventiles),  and  lowest  (CR,  0.8%)  in  the 
fourth  stratum  (made  up  of  the  55%  of  suicides  in  the  lowest 
11  ventiles). 


MPP  of  0.1  with  122  predictors 
(AUC,  0.83) 

MPP  of  0.4  with  89  predictors 
(AUC.  0.85) 

MPP  of  0.7  with  72  predictors 
(AUC.  0.85) 

MPP  of  1.0  with  73  predictors 
(AUC.  0.85) 

Conventional  model  with  20 
predictors  (AUC,  0.84) 


0.4  0.5  0.6 

l-Specificity 
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Table  1.  CR,  AUC,  and  Np  Values  by  Mixing  Parameter  Penalty® 

j  Mixing  Parameter  Penalty 

Allowed  - = - - - 


Predictor 

0.1 

0.4 

0.7 

1.0 

25 

CR 

26.5 

29.4 

35.3 

36.8 

AUC 

0.71 

0.75 

0.77 

0.79 

% 

30 

27 

26 

30 

50 

CR 

29.4 

41.2 

42.6 

50.0 

AUC 

0.74 

0.80 

0.82 

0.84 

rip 

53 

51 

53 

56 

100 

CR 

45.6 

51.5 

51.5 

52.9 

AUC 

0.82 

0.85 

0.85 

0.85 

% 

109 

89 

72 

73 

200 

CR 

48.5 

51.5 

51.5 

52.9 

AUC 

0.84 

0.85 

0.85 

0.85 

Hp 

122 

89 

72 

73 

421 

CR 

48.5 

51.5 

51.5 

52.9 

AUC 

0.84 

0.85 

0.85 

0.85 

Hp 

122 

89 

72 

73 

Abbreviations:  AUC,  area  under  the  receiver  operating  characteristic  curve;  CR, 
concentration  of  risk;  Op,  number  of  selected  predictors. 

®  The  CR  is  the  proportion  of  all  observed  posthospitalization  suicides  that 
occurred  in  the  12  months  after  hospital  discharge  (or  <12  months  if  the  soldier 
terminated  services  before  12  months  after  hospital  discharge)  that  occurred 
after  the  5%  of  hospitalizations  classified  by  the  model  as  having  highest  risk 
of  suicide.  See  the  Statistical  Analysis  section  for  a  discussion  of  elastic  net 
models  and  mixing  parameter  penalties. 


Suicide  risk  ranged  from  1338.8  per  100  000  hospitaliza¬ 
tions  in  the  highest-risk  stratum  to  20.3  per  100  000  hospital¬ 
izations  in  the  lowest-risk  stratum  (Table  3).  However,  be¬ 
cause  mean  time  in  service  after  hospital  discharge  was 
considerably  less  than  12  months,  suicide  risk  per  100  000  per¬ 
son-years  was  considerably  higher  than  per  100  000  hospital¬ 
izations:  3824.1  per  100  000  person-years  in  the  highest-risk 
stratum  to  40.9  per  100  000  in  the  lowest-risk  stratum. 

Stability  of  Estimates 

The  CR  in  the  highest-risk  stratum  did  not  differ  signifi¬ 
cantly,  depending  on  whether  (1)  hospitalization  was  in  a  fa¬ 
cility  with  a  mental  health  inpatient  unit  vs  a  general  medical 
facility  without  such  a  unit  (48.2%  vs  66.7;  X?  =  1.7;  f  =  .19);  (2) 
the  suicide  occurred  before  vs  after  September  1,  2008  (me¬ 
dian  date  of  suicides  during  the  study  period;  38.7%  vs  70.3%; 
X?  =  2.4;  P  =  .12);  or  (3)  the  suicide  did  vs  did  not  occur  within 
3  months  of  hospital  discharge  (median  time  to  postdis¬ 
charge  suicide;  52.6%  vs  56.7%;  x?  =  0.0;  P  =  .99). 

Associations  of  Suicide  Risk  With  Other  Adverse  Outcomes 

Soldiers  in  the  highest-risk  stratum  also  had  elevated  risks  of 
other  adverse  outcomes  in  the  year  after  hospital  discharge, 
including  unintentional  injury  deaths  (CR,  10.1%;  Xi  =  7-1; 
P  =  .008),  suicide  attempts  (CR,  9.1%;  x?  =  332.7;  P  <  .001),  and 

jamapsychiatry.com 


Table  2.  ORs  (95%  CIs)  and  VIFs  for  the  Discrete-Time  Logistic  Survival 
Model^ 


Variable 

OR  (95%  Cl) 

VIF'’ 

Sociodemographics 

Male  sex  (yes/no) 

7.9  (1.9-32.6)' 

1.0 

Age  of  enlistment  >27  y  (yes/no) 

1.9  (1.0-3. 5)' 

1.0 

AFQT  score  >50th  percentile  (yes/no) 

3.3  (1.7-10.0)' 

1.0 

Access  to  firearms 

No.  of  registered  pistols 

1.3  (1.0-1.6)' 

1.0 

Crime  perpetration 

No.  of  verbal  assault  offenses  in  past  12 
mo 

2.2  (1. 2-4.0)' 

1.0 

Any  nonviolent  weapons  offense  in  past 

24  mo  (yes/no) 

5.6  (1.7-18.3)' 

1.0 

Suicidal  behavior 

Any  prior  suicide  attempt  since  enlistment 
(yes/no) 

2.9  (1. 7-4.9)' 

1.0 

No.  of  outpatient  visits  with  suicidal 
ideation  in  past  12  mo 

1.6  (1.1-2. 5)' 

1.1 

Other  prior  treatment 

>6  Outpatient  visits  with  a  mental  health 
professional  in  past  12  mo  (yes/no) 

1.9  (1.0-3.6)' 

1.4 

No.  of  antidepressant  prescriptions  filled 
in  past  12  mo 

1.3  (1. 1-1.7)' 

1.1 

No.  of  psychiatric  hospitalizations/time  in 
service  >50%  percentile  (yes/no) 

0.3  (0.2-0.6)' 

1.2 

Any  prior  inpatient  psychiatric  treatment 

1.8  (0.8-3.7) 

1.8 

in  past  12  mo  (yes/no) 

No.  of  inpatient  days  in  past  12  mo  by 
diagnosis 


Major  depression 

2.2  (1. 1-4.4)' 

1.4 

Somatoform  or  dissociative  disorder 

5.6  (1.8-17.7)' 

1.0 

Characteristics  of  focal  hospitalization 

Hospitalized  in  a  civilian  psychiatric 
hospital  or  civilian  facility  with  a 
psychiatric  unit  (yes/no) 

1.6  (1.0-2.7)' 

1.0 

Disorders  diagnosed  during  current 
hospitalization  (yes/no) 

PTSD 

0.4  (0.2-0.7)' 

1.1 

Suicidal  ideation 

2.4(1. 3-4.7)' 

1.0 

Nonaffective  psychosis 

2.9  (1. 2-7.0)' 

1.0 

Somatoform  or  dissociative  disorder 

3.6  (1.2-10.8)' 

1.0 

Hearing  loss 

6.0  (2.1-17.4)' 

1.0 

Abbreviations:  AFQT,  US  Armed  Forces  Qualification  Test;  OR,  odds  ratio;  PTSD, 
posttraumatic  stress  disorder;  VI F,  variance  inflation  factor. 

^  The  best  penalized  survival  model  was  a  lasso  model  with  73  predictors  from 
the  total  of  421  predictors  considered.  A  conventional  discrete-time  survival 
model  that  contained  those  same  73  predictors  was  unstable  (VIF  >5.0  for  6 
predictors).  As  a  result,  we  used  forward  stepwise  analysis  with  a  .05-level 
entry  criterion  to  select  a  more  stable  subset  of  the  73  predictors.  The 
coefficients  for  the  20  predictors  that  entered  are  presented  here. 

^  The  VIF  for  the  coefficient  associated  with  predictor  Xj  in  the  above  equation 
equals  1/(1  where  R^i  is  the  coefficient  of  determination  of  a  regression 
equation  in  which  X|  is  the  dependent  variable,  and  all  the  other  19  predictors 
of  suicide  are  included  as  predictors  of  Xj.  A  VIF  greater  than  5.0  is  typically 
considered  an  indicator  of  high  multicollinearity.'^ 

Significant  at  the  .05  level  (2-sided  test).  However,  note  that  the  predictors 
were  selected  using  stepwise  analysis  and  the  current  P  values  are 
consequently  inexact. 


subsequent  hospitalizations  (7-5%;  Xi  =  893-4;  P  <  .001).  Sol¬ 
diers  in  the  highest  predicted  suicide  risk  stratum  had  7  un¬ 
intentional  injury  deaths,  830  suicide  attempts,  and  3765  sub¬ 
sequent  hospitalizations  within  12  months  of  hospital  discharge 
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Figure  2.  Concentration  of  Risk  of  Posthospitalization  Suicides  by  Ventile  of  Predicted  Risk  Based  on  the 
Discrete-Time  Penalized  Survival  Model  With  a  Mixing  Parameter  Penalty  of  1.0 


Ventile  of  Predicted  Risk  hospitalizations  into  groups  defined 

_  by  level  of  predicted  suicide  risk. 


(492,666.2  per  100  000  person-years).  At  least  one  of  these  out¬ 
comes  occurred  after  46.3%  of  the  highest-risk  hospitaliza¬ 
tions. 


Discussion 

Although  risk  factors  for  suicide  are  widely  known,  synthe¬ 
sizing  this  information  to  optimize  suicide  prediction  has  been 
an  elusive  goal  up  to  now.  This  study  addressed  this  problem 
by  using  machine  learning  to  generate  an  actuarial  suicide  risk 
algorithm  from  US  Army  and  DoD  administrative  data,  find¬ 
ing  that  52.9%  of  suicides  occurred  after  the  5%  of  hospital¬ 
izations  with  highest  predicted  risk.  Although  interventions 
in  this  high-risk  stratum  would  not  solve  the  entire  US  Army 
suicide  problem  given  that  posthospitalization  suicides  ac¬ 
count  for  only  12%  of  all  US  Army  suicides,  the  algorithm  would 


presumably  help  target  preventive  interventions .  Before  clini¬ 
cal  implementation,  though,  several  key  issues  must  be  ad¬ 
dressed. 

The  first  question  is  whether  the  risk  algorithm  is  suffi¬ 
ciently  stable  to  predict  future  suicides  given  that  it  is  based 
on  only  68  prior  suicides.  It  is  noteworthy  that  the  machine 
learning  methods  used  to  create  the  algorithm  were  designed 
explicitly  to  maximize  stability  of  predictions.  Within- 
sample  stability  analyses  found  that  the  CR  did  not  vary  sig¬ 
nificantly  by  type  of  inpatient  facility,  year  of  hospitalization, 
or  number  of  months  since  hospital  discharge;  however,  this 
does  not  guarantee  future  stability.  Algorithm  stability  will  con¬ 
sequently  be  tested  again  in  the  2010-2013  US  Army  suicide  data 
in  a  future  study  to  address  this  question. 

The  second  question  is  whether  the  risk  algorithm  im¬ 
proves  on  clinical  judgment.  The  study  was  unable  to  exam¬ 
ine  this  issue  empirically  because  the  US  Army  electronic  medi- 


Table  3.  CR  and  Conditional  Risk  of  Posthospitalization  Suicides  by  Risk  Strata  Across  All  Hospitalizations 


Strata  of  Predicted  Suicide  Risk  Based  on  the  Lasso  Model^ 

Variable 

Highest-Risk  Stratum 
(First  Ventile) 

Second  Ventile 

Third  to  Ninth 
Ventiles 

Lowest-Risk  Stratum 
(10th-20th  Ventiles) 

Total 

Observed  No.  of  suicides 

36 

6 

20 

6 

68 

CR,  %'‘ 

52.9 

8.8 

4.2 

0.9 

NA 

No.  per  100  000  person-years 

Hospitalizations 

1338.8 

223.3 

106.3 

20.3 

126.5 

Person-years 

3824.1 

538.7 

221.1 

40.9 

263.9 

No.  of  hospitalizations 

2689 

2687 

18  820 

29  573 

53  769 

Abbreviations;  CR,  concentration  of  risk;  NA,  not  applicable. 

•^The  CR.  which  is  defined  as  the  proportion  of  all  the  observed  outcomes  of  the 

“  Ventiles  of  suicide  risk  are  20  groups  of  hospitalizations  of  equal  frequency 
(n  =  2688-2689  hospitalizations)  dividing  the  total  of  53769  hospitalizations 
into  groups  defined  by  level  of  predicted  suicide  risk.  The  third  through  ninth 
ventiles  were  collapsed  into  a  single  risk  stratum  based  on  the  fact  that 
observed  suicide  risk  was  comparable  in  these  7  ventiles.  The  10th  through 
20th  ventiles  were  collapsed  into  a  final  risk  stratum  based  on  similar 
evidence. 


type  that  occurred  in  the  12  months  after  hospital  discharge  (or  <12  months  if 
the  soldier  terminated  services  before  12  months  after  hospital  discharge)  that 
occurred  in  the  risk  ventile  represented  by  the  column  heading.  The  CR  is 
defined  separately  for  each  of  the  2  highest  risl<  ventiles  and  then  as  a 
per-ventile  mean  for  the  next  7  ventiles  treated  as  a  single  risk  stratum  and 
then  final  11  ventiles  treated  as  a  separate  risk  stratum. 
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cal  record  does  not  include  a  structured  field  where  health  care 
professionals  must  record  suicide  risk  assessments.  In  addi¬ 
tion,  documentation  of  suicide  risk  assessment  in  clinical  notes 
was  not  consistent  during  the  study  period.  However,  with  im¬ 
proved  documentation  after  the  VA  and  DoD  CPG,  compari¬ 
son  of  actuarial  to  clinical  prediction  may  be  possible  in  the 
future.  As  noted  in  the  Introduction,  though,  previous  re¬ 
search  has  indicated  that  actuarial  suicide  prediction  is  much 
more  accurate  than  prediction  based  on  clinical  judgment.^^'^'* 
This  evidence  is  consistent  with  a  large  body  of  literature  re¬ 
porting  that  actuarial  methods  are  superior  to  expert  judg¬ 
ments  in  many  areas  of  prediction.'^^-'^®  At  the  same  time,  the 
comprehensive  suicide  risk  assessments  required  by  the  new 
VA  and  DoD  CPG“  will  generate  information  not  included  in 
administrative  records.  As  a  result,  our  algorithm  should  be 
seen  as  a  component  of  this  comprehensive  clinical  assess¬ 
ment  rather  than  a  substitute  for  this  assessment. 

The  third  question  is  whether  suicide  is  sufficiently  com¬ 
mon  in  the  highest-risk  stratum  and  available  interventions  suf¬ 
ficiently  powerful  to  make  targeted  posthospitalization  inter¬ 
ventions  efficient  compared  with  alternative  ways  of  deploying 
the  same  clinical  resources.  Our  results  shed  no  light  on  this 
question.  The  potential  for  harm  also  has  to  be  taken  into  con¬ 
sideration  because  intensive  posthospitalization  interven¬ 
tions  might  lead  to  undue  scrutiny  by  nonmedical  leaders  that 
adversely  affect  soldier  careers.  This  concern  is  all  the  more 
important  given  that  most  soldiers  identified  as  being  high  risk 
do  not  commit  suicide.  Although  a  formal  analysis  of  com¬ 
parative  risks  and  benefits  is  beyond  the  scope  of  this  report, 
it  is  noteworthy  that  the  highest-risk  stratum  had  signifi¬ 
cantly  elevated  risks  of  other  adverse  outcomes  and  that  preva¬ 
lence  of  at  least  one  such  outcome  was  present  after  46.3%  of 
highest-risk  hospitalizations.  Ameliorative  effects  of  ex¬ 


panded  high-risk  interventions  on  these  outcomes  (ie,  unin¬ 
tentional  injury  deaths,  suicide  attempts,  and  subsequent  hos¬ 
pitalizations)  are  plausible  because  numerous  risk  factors  for 
suicide  (eg,  depression  and  substance  abuse)  are  also  risk  fac¬ 
tors  for  these  other  outcomes^-^'^-'^®  and  most  suicide  preven¬ 
tion  interventions  recommended  for  high-risk  patients  are 
likely  to  affect  these  outcomes  as  well.^’“  These  presumed  ben¬ 
efits  would  have  to  be  considered  in  a  broad-based  evalua¬ 
tion  of  risks  and  benefits  of  any  future  targeted  high-risk  post¬ 
hospitalization  preventive  interventions. 

The  major  limitations  of  our  analysis  involve  errors  in  the 
administrative  data  used  as  predictors  (missing  and  inconsis¬ 
tent  values  and  errors  inICD-9-CM diagnoses).  In  addition,  the 
algorithm  could  almost  certainly  be  improved  if  more  nu- 
anced  risk  factor  data  were  available.  Because  the  new  VA  and 
DoD  CPG  contains  a  checklist  of  risk  factors  health  care  pro¬ 
fessionals  are  urged  to  assess  in  evaluating  suicide  risk,  cre¬ 
ation  of  a  system  to  record  these  assessments  in  the  elec¬ 
tronic  medical  record  along  with  the  health  care  professional’s 
clinical  global  impression  of  patient  suicide  risk  might  in¬ 
crease  the  completeness  of  these  assessments  and  provide  a 
rich  source  of  information  for  future  risk  algorithm  refine¬ 
ment. 


Conclusions 

The  high  concentration  of  risk  of  suicides  and  other  adverse 
outcomes  might  justify  targeting  expanded  posthospitaliza¬ 
tion  interventions  to  soldiers  classified  as  having  highest  post¬ 
hospitalization  suicide  risk,  although  final  determination  re¬ 
quires  careful  consideration  of  intervention  costs,  comparative 
effectiveness,  and  possible  adverse  effects. 
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McNeil  Janssen  Scientific  Affairs,  Johnson  & 
Johnson,  Plus  One  Health  Management,  and 
Wyeth-Ayerst  and  receiving  research  support  for  his 
epidemiologic  studies  from  Analysis  Group  Inc. 
Bristol-Myers  Squibb,  Eli  Lilly  &  Company,  EPI-Q, 
GlaxoSmithKline,  Johnson  &  Johnson 
Pharmaceuticals,  Ortho-McNeil  Janssen  Scientific 
Affairs,  Pfizer  Inc,  Sanofi-Aventis  Groupe  SA,  Shire 
US  Inc.  and  Walgreens  Co.  Dr  Kessler  reported 
owing  a  25%  share  in  DataStat  Inc.  Dr  Stein 
reported  being  a  consultant  for  Healthcare 
Management  Technologies  and  receiving  research 
support  for  pharmacologic  imaging  studies  from 
Janssen.  No  other  disclosures  were  reported. 

Funding/Support:  The  Army  STARRS  was 
sponsored  by  the  US  Department  of  the  Army  and 
funded  under  cooperative  agreement 
U01MH087981  with  the  National  Institute  of 
Mental  Health,  National  Institutes  of  Health,  US 
Department  of  Health  and  Human  Services. 

Role  of  the  Funder/Sponsor:  As  a  cooperative 
agreement,  scientists  employed  by  the  National 
Institute  of  Mental  Health  (Drs  Coipe  and 
Schoenbaum)  and  US  Army  liaisons  and  consultants 
(Dr  Cox  and  Steven  Cersovsky,  MD,  MPH) 
collaborated  to  develop  the  study  protocol  and 
data  collection  instruments,  supervise  data 
collection,  interpret  results,  and  prepare  reports. 
Although  a  draft  of  the  manuscript  was  submitted 
to  the  US  Army  and  the  National  Institute  of  Mental 
Health  for  review  and  comment  before  submission, 
this  was  with  the  understanding  that  comments 
would  be  only  advisory. 

Group  Information:  The  Army  STARRS  coprincipal 
investigators  were  Robert  J.  Ursano,  MD 
(Uniformed  Services  University  of  the  Health 
Sciences),  and  Murray  B.  Stein,  MD,  MPH 
(University  of  California,  San  Diego,  and  Veterans 
Affairs  San  Diego  Healthcare  System).  The  site 
principal  investigators  were  Steven  G.  Heeringa, 

PhD  (University  of  Michigan),  and  Ronald  C.  Kessler. 
PhD  (Harvard  Medical  School).  The  National 
Institute  of  Mental  Health  collaborating  scientists 
were  Lisa  J.  Coipe,  PhD,  MPH.  and  Michael 
Schoenbaum,  PhD.  The  US  Army  liaisons  and 
consultants  were  Steven  Cersovsky,  MD,  MPH,  and 
Kenneth  L.  Cox,  MD.  MPH. 

Other  team  members  were  Pablo  A.  Aliaga,  MA 
(Uniformed  Services  University  of  the  Health 
Sciences),  David  M.  Benedek,  MD  (Uniformed 
Services  University  of  the  Health  Sciences),  Susan 
Borja,  PhD  (National  Institute  of  Mental  Health), 
Gregory  G.  Brown,  PhD  (University  of  California, 

San  Diego),  Laura  Campbell-Sills,  PhD  (University  of 
California,  San  Diego),  Catherine  L.  Dempsey,  PhD, 
MPH  (Uniformed  Services  University  of  the  Health 
Sciences),  Richard  Frank,  PhD  (Harvard  Medical 
School),  Carol  S.  Fullerton,  PhD  (Uniformed 
Services  University  of  the  Health  Sciences),  Nancy 
Gebler,  MA  (University  of  Michigan),  Robert  K. 
Gifford.  PhD  (Uniformed  Services  University  of  the 
Health  Sciences),  Stephen  E.  Gilman,  ScD  (Harvard 
School  of  Public  Health),  Marjan  G.  Holloway,  PhD 
(Uniformed  Services  University  of  the  Health 
Sciences),  Paul  E.  Hurwitz,  MPH  (Uniformed 
Services  University  of  the  Health  Sciences),  Sonia 
Jain,  PhD  (University  of  California,  San  Diego), 
Tzu-Cheg  Kao,  PhD  (Uniformed  Services  University 
of  the  Health  Sciences),  Karestan  C.  Koenen,  PhD 
(Columbia  University),  Lisa  Lewandowski-Romps, 
PhD  (University  of  Michigan),  Holly  Herberman 


Mash,  PhD  (Uniformed  Services  University  of  the 
Health  Sciences),  James  E.  McCarroll,  PhD,  MPH 
(Uniformed  Services  University  of  the  Health 
Sciences),  Katie  A.  McLaughlin.  PhD  (Harvard 
Medical  School),  James  A.  Naifeh,  PhD  (Uniformed 
Services  University  of  the  Health  Sciences). 
Matthew  K.  Nock,  PhD  (Harvard  University),  Rema 
Raman,  PhD  (University  of  California,  San  Diego), 
Sherri  Rose,  PhD  (Harvard  Medical  School), 

Anthony  Joseph  Rosellini,  PhD  (Harvard  Medical 
School).  Nancy  A.  Sampson,  BA  (Harvard  Medical 
School),  LCDR  Patcho Santiago,  MD,  MPH 
(Uniformed  Services  University  of  the  Health 
Sciences),  Michaelle  Scanlon,  MBA  (National 
Institute  of  Mental  Health),  Jordan  Smoller,  MD, 

ScD  (Harvard  Medical  School),  Michael  L.  Thomas, 
PhD  (University  of  California,  San  Diego),  Patti  L. 
Vegella,  MS,  MA  (Uniformed  Services  University  of 
the  Health  Sciences),  Christina  Wassel,  PhD 
(University  of  Pittsburgh),  and  Alan  M.  Zaslavsky. 
PhD  (Harvard  Medical  School). 

Disclaimer:  The  contents  are  solely  the 
responsibility  of  the  authors  and  do  not  necessarily 
represent  the  views  of  the  US  Department  of 
Health  and  Human  Services,  the  National  Institute 
of  Mental  Health,  the  US  Department  of  the  Army, 
or  the  US  Department  of  Defense. 

Additional  Contributions:  John  Mann,  MD,  Maria 
Oquendo,  MD,  Barbara  Stanley,  PhD,  Kelly  Posner, 
PhD,  and  John  Keilp,  PhD,  Department  of 
Psychiatry,  Columbia  University,  College  of 
Physicians  and  Surgeons,  and  New  York  State 
Psychiatric  Institute.  New  York,  contributed  to  the 
early  stages  of  the  US  Army  STARRS  development. 
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Behavioral-based  predictors  of  workplace  violence  in  the 

Army  STARRS 


PI:  Ronald  C.  Kessler,  PhD  Org:  Harvard  Medical  School 


Study/Product  Aim(s) 


To  develop  practical  risk  prediction  indices  for  workplace  violence 
perpetration  and  victimization  based  on  analyses  of  data  in  the  Army 
Study  to  Assess  Risk  and  Resilience  in  Servicemembers  (A-STARRS).  A- 
STARRS  data  include  administrative  records  from  many  different 
Army/DoD  data-bases,  self-report  survey  data,  neurocognitive  test 
results,  and  blood  samples.  This  large  and  rich  databases  creates  a 
unique  opportunity  to  develop  practical  risk  prediction  indices  for 
workplace  violence  perpetration  and  victimization. 


ARMY 

SX^RRS 

Army  Study  to  Assess  Risk  and  Resilience  in  Servicemembers 


Approach  and  Military  Relevance 

The  preliminary  A-STARRS  data  report  very  high  proportions  of  Soldiers 
who  either  perpetrate  and/or  are  victimized  by  workplace  violence.  The 
results  of  the  proposed  research  could  generate  new  information  about 
the  most  powerful  risk  and  resilience  factors  for  preventing  workplace 
violence  in  the  Army  and  targeting  outreach  efforts  for  high  risk  Soldiers. 


Estimated  Timeline 


Activities  Year 

1 

2 

3 

4 

Build  HADS  survival  files.  Construct 

Basic  models  of  HADS  outcomes 

Add  conceptually-guided  interactions 
&  cross-validated  data-mining  models 

Expand  models  of  HADS  outcomes  with 
survey  and  neurocognitive  predictors 

Develop  &  test  final  workplace  violence 
Perpetration-victimization  indices 

Q8  Report 


Behavioral-based  predictors  of 
workplace  violence  in  the  Army 
STARRS 


Projected  Goals/Milestones 

Year  1:  Due  to  IRB  and  DUA  delays,  work  was  delayed.  Coded  survey  data. 
Developed  a  coding  scheme  to  code  violent  crimes  into  NCRP  categories. 

Year  2:  Disaggregated,  selected,  and  examined  rates  of  administratively  recorded 
workplace  violence  perpetration  and  victimization  in  the  HADS  and  NSS.  Examined 
rates  of  self-reported  workplace  violence  in  the  AAS  and  PPDS.  Constructed  HADS 
and  NSS  predictors.  Built  person-month  data  files  for  HADS  and  NSS  data  mining 
models  for  predicting  first  occurrence  of  workplace  violence  outcomes.  Examined 
bivariate  associations.  Used  cross-validated  data  mining  to  maximize  prediction 
accuracy  and  concentration  of  risk  and  identify  final  predictor  sets  for  1 5  HADS  and 
NSS  outcomes.  Examined  the  relative  strength  of  the  HADS  and  NSS  predictors 
Year  3:  Finalize  HADS  and  NSS  data  mining  models.  Develop  parallel  data  mining 
models  predicting  recurrence  of  workplace  violence  in  the  HADS.  Develop  parallel 
data  mining  models  predicting  first  occurrence  of  self-reported  workplace  violence  in 
the  AAS  and  PPDS.  Continue  analyses  that  distinguish  competing  interpretations  of 
significant  predictors.  Complete  all  data  mining  analyses. 

Year  4:  Carry  out  conceptually-guided  interaction  analyses  based  on  the  stressor- 
emotion  and  rational  choice  models  of  workplace  violence.  Finalize  computer 
programs  to  predict  perpetration  and  victimization.  Finalize  analyses  and 
recommendations  for  optimal  targets  for  interventions. 


